Question 1

What is Vast.ai Serverless?

Accepted Answer

Vast.ai Serverless is an on-demand GPU platform that auto-scales across Vast.ai's marketplace — spanning consumer GPUs to H100 clusters — and automatically routes workloads to the optimal machine class and price point for each job. Billing is per-second at raw marketplace GPU rates, with no serverless premium, contracts, quotas, or upfront commitments.

Question 2

How does Vast.ai Serverless pricing work?

Accepted Answer

Vast.ai Serverless bills per-second at raw marketplace GPU rates — no serverless surcharge, pricing tiers, or hidden fees. Workloads are automatically routed across the marketplace's range of GPU classes and price points to find the optimal cost-performance fit for each job. Get started with a $5 minimum balance.

Question 3

What GPUs are available on Vast.ai Serverless?

Accepted Answer

Vast.ai Serverless offers access to more than 68 GPU types across 500+ global locations. Available hardware includes popular options like RTX 5090 GPUs, NVIDIA H100 GPUs, NVIDIA B200 GPUs, and RTX PRO 6000 GPUs. The platform automatically matches your workload to suitable GPU hardware.

Question 4

How do I deploy a serverless endpoint on Vast.ai?

Accepted Answer

You can deploy a Vast.ai serverless endpoint from the dashboard, CLI, API, or directly in Python with the Vast SDK. The @remote decorator lets you run Python functions as serverless GPU jobs without building HTTP wrappers or managing GPU infrastructure manually.

Question 5

Is my data secure on Vast.ai Serverless?

Accepted Answer

Yes. Vast.ai Serverless workloads run on isolated instances without container sharing. Vast.ai is SOC 2 Type I certified and supports private VPN access, audit trails, and data deletion on request to help meet security and compliance requirements.

Lowest Cost, Autoscaling GPU Cloud on the Market

Where GPU Cloud Meets Serverless

Easy to Use

Transparent Pricing

Access All Hardware

Flexible Regions

Serverless Key Features

Dynamic Scaling

Global GPU Fleet

Fast Cold-Start Times

Metrics and Debugging

Deploy from Python, Not the Dashboard

Custom Worker Types

What Does Vast.ai Stack Up?

Private by Design. Secure by Default.

Full Environment Control

Compliance-Ready

Data Sovereignty

Enterprise Security Features

Predictive Optimization

On-Demand GPU Deployment

Flexible, Transparent Pricing

Secure Cloud Isolation

Dev-First Interfaces

Up-to-date Templates

Support That Doesn't Sleep

Unrestricted Selection & Control

Frequently Asked Questions

From Zero to Compute in Seconds