
Jobs
Our mission is to organize, optimize, and orient the world's computation.
Join the team building the future of AI.
View Open RolesWhat We're Building
Vast.ai is the AI compute platform — 20,000+ GPUs, 25,000+ monthly customers, and 8 years of operations data. We're building the infrastructure layer where AI agents and developers programmatically provision and manage GPU compute.
We are also developing software to accelerate the training and deployment of complex neural networks on our decentralized infrastructure.
Our Locations
We have offices in Los Angeles and San Francisco.
Vast.ai Los Angeles
1100 Glendon Ave #1840
Los Angeles, CA 90024
Vast.ai San Francisco
100 First Street, #2250
San Francisco, CA 94105
The Work
The journey to our destiny will not be easy. Our goal is not the safe harbor of today. You will be taxed, and you will be pushed. We love to work. We can't help it; we are witnessing the birth of AGI.
All technical roles report to Jake Cannell, the CEO and founder. Jake is a prolific writer and thinker on the subject of AI. Two examples from Less Wrong:
LOVE in a simbox is all you needThe Brain as a Universal Learning MachineOpen Roles
Current openings at Vast.ai
Every opening listed here is managed directly by Vast.ai.
Engineering
5 roles
Open each role for the full description, application details, and submission path.
C++ Software Engineer — Systems
Own performance-critical systems like the daemon orchestrating every host in our fleet — performance, reliability, containerization, and telemetry. On-site in SF or LA.
Expand roleSan Francisco or Los Angeles, CAOn-siteFull-time$120K – $180K
C++ Software Engineer — Systems
Own performance-critical systems like the daemon orchestrating every host in our fleet — performance, reliability, containerization, and telemetry. On-site in SF or LA.
As a Systems Software Engineer, you will work on key performance critical systems such as the daemon which orchestrates every host in our fleet. You will improve the performance, reliability and capability of our infrastructure and containerization technologies including monitoring and telemetry. Tech stack: C, C++, Python, Linux.
What You Will Do
- Expand and extend our GPU cloud daemon
- Design and deploy market-based resource management systems
- Harden code and infrastructure to meet zero-trust standards
- Benchmark, profile, and eliminate bottlenecks across hypervisor, container, and network layers
What We Need
- Programming: Strong programming skills in at least one language, ideally C++
- Linux and Virtualization: Extensive knowledge of Linux kernel internals, containerization technologies, and virtualization
- Isolation Techniques: Deep understanding of workload and network isolation techniques in multi-tenant environments
- Cloud Security: Experience in securing and hardening cloud infrastructure, particularly in environments with untrusted workloads
- Multi-tenant Security: Strong background in workload and network isolation, network security, and cloud-native security practices
- GPU Security: Experience with GPU programming and an understanding of GPU-specific security concerns.
Benefits
- Comprehensive health, dental, vision, and life insurance
- 401(k) with company match
- Meaningful early-stage equity
- Onsite meals, snacks, and close collaboration with founders/tech leaders
- Ambitious, fast-paced startup culture where initiative is rewarded
Application Details
After submitting your application, our technical team reviews your credentials. If selected, you'll proceed through the following stages: 15 min - Initial screening (virtual); 45 min - Quick dive into Vast, systems and architectures (virtual); 1 hour - LLM-assisted coding assessment (virtual); 2 hours - Meet and greet with coding assessment (on-site). We aim to complete the interview process in about one week.
Apply directly inside Vast.ai.
GPU Systems Engineer – HPC / Parallel Computing
Bring HPC and parallel-programming expertise to AI inference — design and optimize GPU kernels and tensor libraries at the bleeding edge. On-site in SF or LA.
Expand roleSan Francisco or Los Angeles, CAOn-siteFull-time$160K – $320K
GPU Systems Engineer – HPC / Parallel Computing
Bring HPC and parallel-programming expertise to AI inference — design and optimize GPU kernels and tensor libraries at the bleeding edge. On-site in SF or LA.
We’re looking for a systems engineer with HPC or parallel programming experience to help scale AI inference. You’ll leverage your knowledge of high-performance systems to optimize GPU performance at the bleeding edge of AI. Tech stack: CUDA/C++, GPGPU, Python, Linux.
What You Will Do
- Design and optimize GPU kernels and tensor libraries
- Translate HPC techniques into scalable AI inference solutions
- Evaluate emerging architectures and resource management approaches
- Collaborate with technical leadership to improve GPU infrastructure efficiency
What We Need
- Advanced C++ (C++17/20 preferred)
- Expertise with at least one parallel framework (CUDA, HIP, SYCL, OpenCL, OpenACC, or similar)
- Strong background in systems optimization and HPC performance tooling
Nice To Have
- Familiarity with distributed training/inference frameworks
Benefits
- Comprehensive health, dental, vision, and life insurance
- 401(k) with company match
- Meaningful early-stage equity
- Onsite meals, snacks, and close collaboration with founders/tech leaders
- Ambitious, fast-paced startup culture where initiative is rewarded
Application Details
After submitting your application, our technical team reviews your credentials. If selected, you'll proceed through the following stages: Initial screening (virtual, 15 minutes); Quick dive into Vast, systems and architectures (virtual, 30 minutes); LLM-assisted coding assessment (virtual, 1 hour); Meet and greet with coding assessment (on-site, 2 hours). Our goal is to complete the interview process in two weeks.
Apply directly inside Vast.ai.
QA Associate
Manual and automated QA for the web apps and backend services behind Vast.ai's GPU marketplace. Onsite 5 days a week in Westwood, Los Angeles.
Expand roleLos Angeles, CAOn-siteFull-time$40/Hr
QA Associate
Manual and automated QA for the web apps and backend services behind Vast.ai's GPU marketplace. Onsite 5 days a week in Westwood, Los Angeles.
We are seeking a highly skilled QA Associate to do manual and automated testing of web apps and backend services in Vast's Linux-first environment. This role is critical to ensure that our complex, always-on, high-traffic systems are reliable and performant. The ideal candidate is both highly technical and sensitive to the detailed needs of our users. This role is onsite 5 days a week in our Westwood, Los Angeles office.
What You Will Do
- Execute manual and exploratory testing for web apps + backend services
- Maintain existing manual test plans and write new plans for features being developed
- Design high-signal test cases and automation
- Test and validate software to ensure that it satisfies requirements and is defect free
- Analyze the root cause for testing failures and open appropriate tickets with sufficient findings
- Collaborate with the Product and Development teams to define acceptance criteria and ship reliable releases
What We Need
- 3+ years hands-on testing of web applications and APIs
- Strong knowledge of test methodologies and their corresponding tools
- Experience with writing test plans and test cases for assigned features
- Experience with test automation and lightweight scripting/coding
- Keen eye for detail
- Proficient with Linux
Nice To Have
- Passionate about the future of AI
- API testing with Postman/Newman or similar
- Containers and orchestration basics (Docker; Kubernetes concepts)
- Experience with load testing tools
- Familiarity with GPUs and GPU drivers—very nice to have, but not required
Benefits
- Work 5 days a week from the Vast.ai HQ in Westwood, Los Angeles in an ambitious, fast-paced, AI-centered startup environment
- Health, dental, vision and life insurance coverage
- Matching 401K
Apply directly inside Vast.ai.
Security Engineer
Offensive and defensive security for a global GPU cloud — secure architecture, assessments, tooling, and compliance (SOC 2, ISO 27001). Onsite in Westwood, LA.
Expand roleLos Angeles, CAOn-siteFull-time$145K – $185K
Security Engineer
Offensive and defensive security for a global GPU cloud — secure architecture, assessments, tooling, and compliance (SOC 2, ISO 27001). Onsite in Westwood, LA.
We are seeking a skilled Security Engineer to join our dynamic team. We hire people with broad skill sets who also exhibit deep expertise. The ideal candidate will have experience in both offensive and defensive security, strong software development skills, and deep knowledge of Linux systems and containerization. This role provides the opportunity to work on cutting-edge GPU cloud technologies, tackle complex security challenges at scale, and directly enhance the resilience and trustworthiness of our infrastructure and services.
What You Will Do
- Collaborate with Operations Team: Partner with our operations team to ensure compliance with relevant standards such as SOC 2, ISO 27001, and GDPR
- Secure Architecture Design: Develop and implement secure architectures for our GPU cloud platform
- Security Assessments: Conduct security assessments, threat modeling, code reviews, and penetration testing
- Security Improvements: Develop and implement security fixes and improvements in collaboration with engineering teams
- Security Tools Management: Implement and manage security tools and systems, including SIEM, WAF, and EDR
- Documentation: Create and maintain security documentation, including policies, procedures, and technical guidelines
- Security Training: Provide security guidance and training to engineering teams to foster a security-first culture
- Incident Response: Participate in incident response activities and contribute to post-incident analysis and improvements
What We Need
- A problem-solver who thrives in a fast-paced environment
- Committed to continuous improvement and staying updated with the latest security practices and cloud technologies
- A team player with strong communication skills, able to bridge the gap between development and security
- Educational: Bachelor's degree in Computer Science, Cybersecurity, or a related field.
- Programming: Strong programming skills in at least one language, ideally Python or C.
- Linux and Virtualization: Extensive knowledge of Linux kernel internals, containerization technologies, and virtualization
- Isolation Techniques: Deep understanding of workload and network isolation techniques in multi-tenant environments
- Cloud Security: Experience in securing and hardening cloud infrastructure, particularly in environments with untrusted workloads
- Network and Application Security: Strong background in network security, application security, and cloud-native security practices
- Security Testing Tools: Experience with security testing tools and methodologies, such as OWASP, Burp Suite, and static/dynamic analysis tools
- Cybersecurity Frameworks: Familiarity with common cybersecurity frameworks, including SOC 2, NIST, and CIS Controls
Nice To Have
- Security Certifications: Relevant security certifications such as CISSP, CCSP, or OSCP.
- DevSecOps Experience: Experience with DevSecOps practices and tools in cloud environments.
- Regulatory Compliance: Familiarity with regulatory compliance requirements for operating cloud services.
- GPU Security: Experience with GPU programming and an understanding of GPU-specific security concerns.
Benefits
- Comprehensive health, dental, vision, and life insurance
- 401(k) with company match
- Meaningful early-stage equity
- Onsite meals, snacks, and close collaboration with founders/tech leaders
- Ambitious, fast-paced startup culture where initiative is rewarded
Apply directly inside Vast.ai.
Senior Infrastructure Engineer
Design and scale the core systems behind Vast.ai's global GPU marketplace — provisioning, scheduling, billing, and orchestration. On-site in SF or LA.
Expand roleSan Francisco or Los Angeles, CAOn-siteFull-time$180K – $300K
Senior Infrastructure Engineer
Design and scale the core systems behind Vast.ai's global GPU marketplace — provisioning, scheduling, billing, and orchestration. On-site in SF or LA.
As a Senior Infrastructure Engineer, you will help design and scale the core systems that power Vast.ai’s global GPU marketplace. You’ll work closely with our founders and core engineering team to extend the underlying compute infrastructure — from GPU provisioning and scheduling to billing, orchestration, and marketplace dynamics. We’re looking for someone who has previously built large-scale infrastructure platforms — systems with similarities to Vast.ai, or distributed compute orchestration frameworks. Tech stack: Python, C++, PostgreSQL, Linux, Docker, KVM, Redis, Terraform, AWS, REST/gRPC APIs.
What You Will Do
- Improve the backend systems that power Vast.ai’s compute marketplace
- Integrate GPU provider onboarding, usage tracking, billing, and orchestration APIs
- Develop scalable infrastructure for workload scheduling and resource management
- Optimize pricing and marketplace logic for efficiency and transparency
- Benchmark, profile, and harden systems for performance, reliability, and fault tolerance
- Collaborate with product and infrastructure teams to shape the future of decentralized compute
What We Need
- Distributed Systems: Experience building high-throughput backend systems or compute clouds
- Compute Orchestration: Familiarity with Docker, or custom scheduling frameworks
- GPU Infrastructure: Understanding of GPU provisioning, driver management, and workload scheduling
- Billing & Metering: Implemented or integrated usage-based billing and account credit systems
- Marketplace Dynamics: Knowledge of dynamic pricing, spot instances, or supply-demand balancing mechanisms
- Security & Multi-Tenancy: Experience designing secure, multi-tenant systems in cloud environments
- Programming: Strong programming skills in Python and C++; ability to write performant, maintainable, well-architected code
- Database Expertise: Comfortable designing schemas and queries for large-scale data systems (PostgreSQL preferred)
Nice To Have
- Experience with GPU security, virtualization, or zero-trust compute isolation
- Prior startup experience or end-to-end product ownership
Benefits
- Comprehensive health, dental, vision, and life insurance
- 401(k) with company match
- Meaningful early-stage equity
- Onsite meals, snacks, and close collaboration with founders and tech leads
- Ambitious, fast-paced startup culture where initiative is rewarded
Application Details
After submitting your application, our technical team reviews your credentials. If selected, you’ll proceed through the following stages: 15 min – Initial screening with member of your future team (virtual); 40 min – Systems and architectures (virtual); 1 hour – LLM-assisted coding assessment (virtual); 2 hours – Meet and greet with coding assessment (on-site). We aim to complete the interview process in about one week.
Apply directly inside Vast.ai.
Research
2 roles
Open each role for the full description, application details, and submission path.
AI Agent Researcher
Help build the next generation of general learning agents — cutting-edge research on memory, reliability, and reasoning. On-site in SF or LA.
Expand roleSan Francisco or Los Angeles, CAOn-siteFull-time$160K – $320K
AI Agent Researcher
Help build the next generation of general learning agents — cutting-edge research on memory, reliability, and reasoning. On-site in SF or LA.
As an Agent Research Engineer, you will help advance the highest levels of the tech stack to imbue AI systems with true agency. Collaborating directly with our technical founder and diverse team, we will build the next generation of general learning agents. Possessing and maintaining a wide, deep and holistic knowledge base of cutting-edge research is crucial for advancing the mission. Many are called, but few are chosen. Tech stack: Python, LLMs, ANNs, C++/Cuda.
What You Will Do
- Lead cutting-edge research in AI agents, focusing on memory, reliability, and reasoning
- Prototype and evaluate novel state-of-the-art methods/models
What We Need
- Deep expertise in systems engineering across the tech stack
- Strong holistic background in machine learning theory and practice
- Diverse knowledge of neural architectures/circuits: transformers etc
- Published research at top AI conferences
Benefits
- Comprehensive health, dental, vision, and life insurance
- 401(k) with company match
- Meaningful early-stage equity
- Onsite meals, snacks, and close collaboration with founders/tech leaders
- Ambitious, fast-paced startup culture where initiative is rewarded
Application Details
After submitting your application, our technical team reviews your credentials. If selected, you'll proceed through the following stages: Initial screening (virtual 15 minutes); Quick dive into Vast, systems and architectures (virtual 30 minutes); LLM assisted coding assessment (virtual 1 hour); Meet and greet with coding assessment (on-site 2 hours). Our goal is to complete the interview process in two weeks.
Apply directly inside Vast.ai.
Systems/GPU Research Engineer
Develop new kernels, algorithms, and high-performance tensor libraries that push AI model inference forward. On-site in SF or LA.
Expand roleSan Francisco or Los Angeles, CAOn-siteFull-time$160K – $320K
Systems/GPU Research Engineer
Develop new kernels, algorithms, and high-performance tensor libraries that push AI model inference forward. On-site in SF or LA.
As a systems/GPU engineer, you will play a crucial role in developing new kernels and algorithms that can improve inference for AI models. You will help develop new high-performance tensor libraries and auto-optimization tools. Collaborating directly with our technical founder and diverse team, you will enhance the performance and efficiency of our AI systems. Your ability to research and stay on top of cutting-edge papers will be vital in staying up-to-date with the latest advancements in AI model inference and GPU programming techniques. Tech stack: C++/CUDA, GPGPU, Python, Linux.
What You Will Do
- Develop or extend parallel generic GPU libraries and kernels
- Help design and deploy market-based resource management systems
- Quickly investigate and summarize options for new system architectures
- Prototype and evaluate novel state-of-the-art methods/models
- Investigate and learn new frameworks and tools
What We Need
- Expertise in systems engineering across the tech stack
- Deep understanding of GPU architectures
- Strong holistic background in neural network performance and tooling
- Published research at top AI conferences
Benefits
- Comprehensive health, dental, vision, and life insurance
- 401(k) with company match
- Meaningful early-stage equity
- Onsite meals, snacks, and close collaboration with founders/tech leaders
- Ambitious, fast-paced startup culture where initiative is rewarded
Application Details
After submitting your application, our technical team reviews your credentials. If selected, you'll proceed through the following stages: Initial screening (virtual, 15 minutes); Quick dive into Vast, systems and architectures (virtual, 30 minutes); LLM-assisted coding assessment (virtual, 1 hour); Meet and greet with coding assessment (on-site, 2 hours). Our goal is to complete the interview process in two weeks.
Apply directly inside Vast.ai.