May 2026 Product Update

This month's updates bring new templates and guides, major upgrades to two-factor authentication, a benchmarking tool to help you pick the right GPU for your workload, and a complete creative AI studio that runs in a single container.
NVIDIA Cloud GPU Updates
We've introduced several platform-level improvements this month. Two-factor authentication (2FA) now supports authenticator apps like Google Authenticator and 1Password instead of relying solely on SMS. Additional upgrades include CLI 2FA support, automatic backup codes for account recovery, multiple authentication methods per account, and 2FA for team roles and API key permission groups.
Note: Action is required if you use legacy SMS 2FA. Simply log in normally, go to the Settings page, and click "Regenerate" on the backup codes section to migrate to the new 2FA in less than a minute.
For developers, the Vast pip package has been upgraded, and the SDK and CLI repositories are now merged into a single, easy-to-grab repo. We've also introduced an agent skill for coding assistants like Claude and Codex to drive our CLI more effectively.
Highlighted Feature: Benchmark Before You Deploy
Choosing the right production GPU is now faster and easier, without any guesswork involved. Vast's benchmarking CLI will take your workload and test it on H100s, A100s, 5090s, 4090s, or any selection you define to compare performance and cost efficiency across GPU classes.
The CLI rents one instance of each GPU type you specify, or a default set for a reasonable starting point if you don't specify any, and then runs the template's built-in benchmark workload, reports performance per dollar, and tears down each rental automatically when finished. Multiple GPU types can be tested in parallel, making it much faster to evaluate your options before committing to longer rentals or production traffic.
Because each run uses real instances and consumes account credit, you get measured performance and cost per hour on actual hardware, not estimates. This benchmarking tool is built for the moment you're staring at a pricing page, wondering whether an H100 is actually worth 4x the cost of a 5090. Sometimes it is, and sometimes it isn't. Now you can find out before you deploy.
New Templates and Guides
We've rolled out another batch of templates and guides to support creative workflows, fine-tuning, experimentation, and multimodal reasoning.
The standout addition this month is the All-in-One App Studio template. It packs eight creative AI applications and a GPU-accelerated remote desktop with KDE Plasma and Blender into a single container, giving you a complete AI production environment without having to manage separate instances.
New Templates
- All-in-One Creative AI Studio - One GPU instance: image generation, video, music, voice, transcription, LoRA training, LLM fine-tuning, and KDE desktop with Blender, each activated only when you need it.
- Unsloth Studio - No-code UI for running and fine-tuning 500+ open-source models with 70% less VRAM.
- Kimi K2.6 - 1T-parameter MoE, 32B active, multimodal agentic model with 256K context.
- Qwen3.5 27B and Qwen 3.5B-A3B - MoE vision-language models with unified multimodal reasoning and 256K context.
- Gemma 4 26B-A4B and Gemma 31B IT - Google's vision-language models with thinking mode and 256K context.
- Autoresearch - Let Claude Code run ML experiments for you, now in a template.
New Guides
Other Improvements
This update also brings fleet-wide mitigations for the recently disclosed CopyFail exploit. No action is required from users, and running workloads remain unaffected.
Our Commitment
As other clouds are out of GPU capacity or charging a premium for what's left, we remain focused on making high-performance AI infrastructure available, affordable, and easy to deploy at scale.
Need help? Contact us anytime at support@vast.ai or join our Discord server for tips, community support, and platform updates in real time.
Change Log
New Features
- Benchmarking CLI for GPU performance and cost comparison.
- All-in-One App Studio: eight creative AI tools in one GPU instance.
- Authenticator app support: use any TOTP app, including Google Authenticator, 1Password, and Microsoft Authenticator, instead of SMS.
- CLI 2FA: 2FA now works through the Vast CLI.
- Backup codes: automatically generate one-time recovery codes so you're never locked out.
- Multiple authentication methods: register more than one per account.
- 2FA for team roles and API key permission groups.
Set up and enable 2FA in Settings -> Full setup guide.
Issues Resolved
- CopyFail exploit: fleet-wide mitigations rolled out with no impact on running workloads and no user action required.
- SSH key input validation, error message, and link fixes.
- Cloud copy fixes.
- Mobile UI fixes.
- Notifications fixes.
- General proactive security fixes.
API Changes
- The Vast pip package has been upgraded, and the SDK and CLI repos have been merged into a single, easy-to-grab repo. Use:
pip install vastai. - Install our agent skill to help Claude, Codex, etc., drive our CLI:
npx skills add vast-ai/vast-cli.
Updated Templates
- Wan2GP - migrated from provisioning installer to Docker.
- Whisper UI - migrated from provisioning to Docker image.
- FluxGym - Dockerized to prevent provisioning failures.
- vLLM - updated build actions for new CU130 image tag.


