The NVIDIA RTX Pro 6000 S is a server-optimized Blackwell GPU designed for large-scale AI inference, fine-tuning, and enterprise deployments. With 96 GB of ECC-enabled GDDR7 VRAM and approximately 1.6 TB/s of memory bandwidth, it delivers the capacity, reliability, and sustained throughput required for modern AI systems running continuously in data-center environments.
The RTX Pro 6000 S brings Blackwell-class performance to the Vast.ai marketplace in a server-first form factor. Built for sustained utilization, it supports ECC memory, enterprise drivers, and advanced isolation features, making it ideal for shared infrastructure, production inference, and multi-tenant AI services.
With 96 GB of GDDR7 ECC memory, the RTX Pro 6000 S is designed to handle large language models, multimodal pipelines, and high-batch inference workloads that exceed the limits of consumer GPUs. Run 70B-80B+ parameter models, long context windows, and concurrent services without constant memory pressure or instability.
The RTX Pro 6000 S supports Multi-Instance GPU (MIG), allowing a single GPU to be securely partitioned into multiple isolated instances. This makes it possible to run multiple inference services, isolate tenants, or separate production and experimental workloads — all while maximizing utilization and uptime.
The RTX Pro 6000 S supports Multi-Instance GPU (MIG), allowing a single GPU to be securely partitioned into multiple isolated instances. This makes it possible to run multiple inference services, isolate tenants, or separate production and experimental workloads all while maximizing utilization and uptime.
Large-scale inference and fine-tuning demand consistency, memory headroom, and sustained performance. The RTX Pro 6000 S is purpose-built for production AI systems, delivering high throughput and low latency across demanding workloads running around the clock.
22% faster
1.45x faster
2.5x faster
“Vast.ai is, quite simply, the best cloud compute provider out there. We've tried them all, but Vast is the only one we stay with—especially for ad-hoc PRO 6000 WS capacity. Their entire experience is absolutely fantastic.”- CTO, AI Solutions Inc.
“Switching to Vast.ai reduced our cloud compute costs by over 70%, while giving us more control, better support, and faster access to PRO 6000 WS capacity. I can't recommend them enough.”- AI Research Lead

Save 5x-6x vs. traditional cloud compute platforms.
No hidden fees. You pay only for what you use.
Rent RTX Pro 6000 S GPUs in minutes, with no waitlists, no sales calls & no delays.
Choose from providers worldwide, with granular control.
Filter by CPU, RAM, bandwidth, location, and more.
Vast.ai's intelligent provisioning ensures the best performance per dollar.