Rent NVIDIA H200
Tensor Core GPUs Now
Get immediate access to the world's most advanced GPU on your terms - from a single GPU to clusters of thousands. Vast.ai makes it easy to rent the H200 GPUs you need at an unbeatable price and terms.

Meet The NVIDIA H200:
The World's Most Advanced GPU For AI & Inference
The NVIDIA H200 is the most powerful GPU on the planet—built for the next generation of generative AI, deep learning, and high-performance computing. Featuring cutting-edge Hopper architecture and available in both PCIe and SXM configurations, the H200 delivers record-setting memory bandwidth and lightning-fast inference speeds. Put simply: the H200 supercharges Generative AI and high-performance computing workloads with game-changing performance, speed, and memory capabilities.
Vast.ai's H200 GPUs Offer
Unmatched Performance forAI & ML Applications
According to NVIDIA Research, the H200 is the first GPU with HBM3e. This larger, faster memory powers the acceleration of generative AI and LLMs while advancing scientific computing for HPC workloads. It is, quite simply, the gold standard for the world's most cutting-edge applications.
Upgrade Memory: 76.5% More HBM3e Memory vs. H100
The NVIDIA H200 GPU features 141GB of HBM3e memory, a whopping 76.5% increase vs. the H100. This increased GPU memory capacity allows larger models to be loaded into memory or larger batch sizes for faster, more efficient training of larger models.
Enhanced Performance: 1.4x Faster HBM3e Memory Bandwidth
The NVIDIA H200's 4.8TB/s memory bandwidth is a staggering 1.4x faster than the H100. This allows for better utilization of processing power, critical for the growing data sets and model sizes of today's frontier LLMs.
Faster Access: 6x Faster Access Speed vs. the H100
The H200 GPU boasts read speeds of up to 20GB/s from one node with the shared filesystem - a 6x improvement vs. the H100 GPU. This is crucial for efficient training of today's LLMs, as well as for inference-related tasks.
Now you can rent H200 GPUs on Vast.ai's intelligent cloud GPU platform, purpose-built to give you access to market-leading GPUs, unparalleled performance, faster speeds, and radically lower prices.
Rent H200s NowExperience Next-Level Performance with the NVIDIA H200
AI inference at scale demands high throughput and low cost. The H200 delivers up to 2x faster inference on large language models like Llama2 compared to the H100—making it one of the most efficient options for serving LLMs to large user bases.
Llama2 70B Inference
1.9x Faster
GPT-3 175B Inference
1.6x Faster
High-Performance Computing
110x Faster
“Vast.ai is, quite simply, the best cloud compute provider out there. We've tried them all, but Vast is the only one we stay with. Their entire experience - from the ease of renting GPUs to the cost-effective pricing, incredible support and unbeatable pricing - is absolutely fantastic.”
Accelerate Your Use Cases with the H200 on Vast.ai
Training massive LLMs like GPT, LLaMA, Mixtral, and Falcon
Fine-tuning vision transformers and diffusion models
Drug discovery and scientific simulations
Generative AI for text, audio, video & code
Real-time inference at hyperscale
Why Vast.aiPricing Works
Get Started On Vast.ai NowMassive Cost Savings
Save 5x-6x vs. traditional cloud compute platforms.
Transparent Pricing
No hidden fees. You pay only for what you use.
Instant Access
Rent H200s in minutes, with no waitlists, no sales calls & no delays.
Global Marketplace
Choose from providers worldwide, with granular control.
Custom Configs
Filter by CPU, RAM, bandwidth, location, and more.
Automated Optimization
Vast.ai's intelligent provisioning ensures the best performance per dollar.
Getting Started
Getting started on Vast.ai is fast, simple, and fully self-serve. Whether you're running a quick experiment or scaling a production-grade AI workload, you can launch H200 GPUs in just a few clicks—no sales calls, no contracts, no delays.
Renting H200 GPUs on Vast.ai is As Easy As 1-2-3
One: Search
Use our powerful filters to find the right H200 instance based on price, location, specs, and provider reputation. Sort by cost-performance ratio, bandwidth, CPU cores, RAM, and more.
Two: Deploy
Launch instantly with our pre-configured templates, or bring your own image. Every instance comes with built-in SSH, Docker, and Jupyter support—so you can get to work right away.
Three: Scale
Need more compute? Add more instances in seconds. Done with a job? Shut it down with one click. You're always in control—with flexible billing and zero lock-in.
A Cloud Compute Rental Platform Built For Developers
Get Started On Vast.ai NowSpin Up in Minutes
Vast.ai abstracts away complexity. You don't need to be a DevOps wizard to spin up an instance in minutes.
Built-In SSH, Jupyter & Docker
Integrated, best-in-class tools so you can begin working on your application faster.
API + CLI Access
Power users get full automation and control.
Flexible Billing
On-demand, interruptible or reserved pricing models that suit your needs.
No Lock-in
Cancel anytime. Vast is built to help you, not trap you.
Rent NVIDIA H200 GPUs Today
Get started with the world's most advanced GPU. No waitlists, no contracts, no delays.