Jobs

Our mission is to organize, optimize, and orient the world's computation.

Join the team building the future of AI.

What We're Building

Vast.ai is the AI compute platform — 20,000+ GPUs, 25,000+ monthly customers, and 8 years of operations data. We're building the infrastructure layer where AI agents and developers programmatically provision and manage GPU compute.

We are also developing software to accelerate the training and deployment of complex neural networks on our decentralized infrastructure.

Our Locations

We have offices in Los Angeles and San Francisco.

Vast.ai Los Angeles

1100 Glendon Ave #1840
Los Angeles, CA 90024

Vast.ai San Francisco

100 First Street, #2250
San Francisco, CA 94105

The Work

The journey to our destiny will not be easy. Our goal is not the safe harbor of today. You will be taxed, and you will be pushed. We love to work. We can't help it; we are witnessing the birth of AGI.

All technical roles report to Jake Cannell, the CEO and founder. Jake is a prolific writer and thinker on the subject of AI. Two examples from Less Wrong:

LOVE in a simbox is all you need The Brain as a Universal Learning Machine

Open Roles

Current openings at Vast.ai

Every opening listed here is managed directly by Vast.ai.

Engineering

5 roles

Open each role for the full description, application details, and submission path.

C++ Software Engineer — Systems

Own performance-critical systems like the daemon orchestrating every host in our fleet — performance, reliability, containerization, and telemetry. On-site in SF or LA.

Expand role

San Francisco or Los Angeles, CAOn-siteFull-time$120K – $180K

As a Systems Software Engineer, you will work on key performance critical systems such as the daemon which orchestrates every host in our fleet. You will improve the performance, reliability and capability of our infrastructure and containerization technologies including monitoring and telemetry. Tech stack: C, C++, Python, Linux.

What You Will Do

Expand and extend our GPU cloud daemon
Design and deploy market-based resource management systems
Harden code and infrastructure to meet zero-trust standards
Benchmark, profile, and eliminate bottlenecks across hypervisor, container, and network layers

What We Need

Programming: Strong programming skills in at least one language, ideally C++
Linux and Virtualization: Extensive knowledge of Linux kernel internals, containerization technologies, and virtualization
Isolation Techniques: Deep understanding of workload and network isolation techniques in multi-tenant environments
Cloud Security: Experience in securing and hardening cloud infrastructure, particularly in environments with untrusted workloads
Multi-tenant Security: Strong background in workload and network isolation, network security, and cloud-native security practices
GPU Security: Experience with GPU programming and an understanding of GPU-specific security concerns.

Benefits

Comprehensive health, dental, vision, and life insurance
401(k) with company match
Meaningful early-stage equity
Onsite meals, snacks, and close collaboration with founders/tech leaders
Ambitious, fast-paced startup culture where initiative is rewarded

Application Details

After submitting your application, our technical team reviews your credentials. If selected, you'll proceed through the following stages: 15 min - Initial screening (virtual); 45 min - Quick dive into Vast, systems and architectures (virtual); 1 hour - LLM-assisted coding assessment (virtual); 2 hours - Meet and greet with coding assessment (on-site). We aim to complete the interview process in about one week.

Apply now

Apply directly inside Vast.ai.

GPU Systems Engineer – HPC / Parallel Computing

Bring HPC and parallel-programming expertise to AI inference — design and optimize GPU kernels and tensor libraries at the bleeding edge. On-site in SF or LA.

Expand role

San Francisco or Los Angeles, CAOn-siteFull-time$160K – $320K

We’re looking for a systems engineer with HPC or parallel programming experience to help scale AI inference. You’ll leverage your knowledge of high-performance systems to optimize GPU performance at the bleeding edge of AI. Tech stack: CUDA/C++, GPGPU, Python, Linux.

What You Will Do

Design and optimize GPU kernels and tensor libraries
Translate HPC techniques into scalable AI inference solutions
Evaluate emerging architectures and resource management approaches
Collaborate with technical leadership to improve GPU infrastructure efficiency

What We Need

Advanced C++ (C++17/20 preferred)
Expertise with at least one parallel framework (CUDA, HIP, SYCL, OpenCL, OpenACC, or similar)
Strong background in systems optimization and HPC performance tooling

Nice To Have

Familiarity with distributed training/inference frameworks

Benefits

Comprehensive health, dental, vision, and life insurance
401(k) with company match
Meaningful early-stage equity
Onsite meals, snacks, and close collaboration with founders/tech leaders
Ambitious, fast-paced startup culture where initiative is rewarded

Application Details

After submitting your application, our technical team reviews your credentials. If selected, you'll proceed through the following stages: Initial screening (virtual, 15 minutes); Quick dive into Vast, systems and architectures (virtual, 30 minutes); LLM-assisted coding assessment (virtual, 1 hour); Meet and greet with coding assessment (on-site, 2 hours). Our goal is to complete the interview process in two weeks.

Apply now

Apply directly inside Vast.ai.

QA Associate

Manual and automated QA for the web apps and backend services behind Vast.ai's GPU marketplace. Onsite 5 days a week in Westwood, Los Angeles.

Expand role

Los Angeles, CAOn-siteFull-time$40/Hr

We are seeking a highly skilled QA Associate to do manual and automated testing of web apps and backend services in Vast's Linux-first environment. This role is critical to ensure that our complex, always-on, high-traffic systems are reliable and performant. The ideal candidate is both highly technical and sensitive to the detailed needs of our users. This role is onsite 5 days a week in our Westwood, Los Angeles office.

What You Will Do

Execute manual and exploratory testing for web apps + backend services
Maintain existing manual test plans and write new plans for features being developed
Design high-signal test cases and automation
Test and validate software to ensure that it satisfies requirements and is defect free
Analyze the root cause for testing failures and open appropriate tickets with sufficient findings
Collaborate with the Product and Development teams to define acceptance criteria and ship reliable releases

What We Need

3+ years hands-on testing of web applications and APIs
Strong knowledge of test methodologies and their corresponding tools
Experience with writing test plans and test cases for assigned features
Experience with test automation and lightweight scripting/coding
Keen eye for detail
Proficient with Linux

Nice To Have

Passionate about the future of AI
API testing with Postman/Newman or similar
Containers and orchestration basics (Docker; Kubernetes concepts)
Experience with load testing tools
Familiarity with GPUs and GPU drivers—very nice to have, but not required

Benefits

Work 5 days a week from the Vast.ai HQ in Westwood, Los Angeles in an ambitious, fast-paced, AI-centered startup environment
Health, dental, vision and life insurance coverage
Matching 401K

Apply now

Apply directly inside Vast.ai.

Security Engineer

Offensive and defensive security for a global GPU cloud — secure architecture, assessments, tooling, and compliance (SOC 2, ISO 27001). Onsite in Westwood, LA.

Expand role

Los Angeles, CAOn-siteFull-time$145K – $185K

We are seeking a skilled Security Engineer to join our dynamic team. We hire people with broad skill sets who also exhibit deep expertise. The ideal candidate will have experience in both offensive and defensive security, strong software development skills, and deep knowledge of Linux systems and containerization. This role provides the opportunity to work on cutting-edge GPU cloud technologies, tackle complex security challenges at scale, and directly enhance the resilience and trustworthiness of our infrastructure and services.

What You Will Do

Collaborate with Operations Team: Partner with our operations team to ensure compliance with relevant standards such as SOC 2, ISO 27001, and GDPR
Secure Architecture Design: Develop and implement secure architectures for our GPU cloud platform
Security Assessments: Conduct security assessments, threat modeling, code reviews, and penetration testing
Security Improvements: Develop and implement security fixes and improvements in collaboration with engineering teams
Security Tools Management: Implement and manage security tools and systems, including SIEM, WAF, and EDR
Documentation: Create and maintain security documentation, including policies, procedures, and technical guidelines
Security Training: Provide security guidance and training to engineering teams to foster a security-first culture
Incident Response: Participate in incident response activities and contribute to post-incident analysis and improvements

What We Need

A problem-solver who thrives in a fast-paced environment
Committed to continuous improvement and staying updated with the latest security practices and cloud technologies
A team player with strong communication skills, able to bridge the gap between development and security
Educational: Bachelor's degree in Computer Science, Cybersecurity, or a related field.
Programming: Strong programming skills in at least one language, ideally Python or C.
Linux and Virtualization: Extensive knowledge of Linux kernel internals, containerization technologies, and virtualization
Isolation Techniques: Deep understanding of workload and network isolation techniques in multi-tenant environments
Cloud Security: Experience in securing and hardening cloud infrastructure, particularly in environments with untrusted workloads
Network and Application Security: Strong background in network security, application security, and cloud-native security practices
Security Testing Tools: Experience with security testing tools and methodologies, such as OWASP, Burp Suite, and static/dynamic analysis tools
Cybersecurity Frameworks: Familiarity with common cybersecurity frameworks, including SOC 2, NIST, and CIS Controls

Nice To Have

Security Certifications: Relevant security certifications such as CISSP, CCSP, or OSCP.
DevSecOps Experience: Experience with DevSecOps practices and tools in cloud environments.
Regulatory Compliance: Familiarity with regulatory compliance requirements for operating cloud services.
GPU Security: Experience with GPU programming and an understanding of GPU-specific security concerns.

Benefits

Comprehensive health, dental, vision, and life insurance
401(k) with company match
Meaningful early-stage equity
Onsite meals, snacks, and close collaboration with founders/tech leaders
Ambitious, fast-paced startup culture where initiative is rewarded

Apply now

Apply directly inside Vast.ai.

Senior Infrastructure Engineer

Design and scale the core systems behind Vast.ai's global GPU marketplace — provisioning, scheduling, billing, and orchestration. On-site in SF or LA.

Expand role

San Francisco or Los Angeles, CAOn-siteFull-time$180K – $300K

As a Senior Infrastructure Engineer, you will help design and scale the core systems that power Vast.ai’s global GPU marketplace. You’ll work closely with our founders and core engineering team to extend the underlying compute infrastructure — from GPU provisioning and scheduling to billing, orchestration, and marketplace dynamics. We’re looking for someone who has previously built large-scale infrastructure platforms — systems with similarities to Vast.ai, or distributed compute orchestration frameworks. Tech stack: Python, C++, PostgreSQL, Linux, Docker, KVM, Redis, Terraform, AWS, REST/gRPC APIs.

What You Will Do

Improve the backend systems that power Vast.ai’s compute marketplace
Integrate GPU provider onboarding, usage tracking, billing, and orchestration APIs
Develop scalable infrastructure for workload scheduling and resource management
Optimize pricing and marketplace logic for efficiency and transparency
Benchmark, profile, and harden systems for performance, reliability, and fault tolerance
Collaborate with product and infrastructure teams to shape the future of decentralized compute

What We Need

Distributed Systems: Experience building high-throughput backend systems or compute clouds
Compute Orchestration: Familiarity with Docker, or custom scheduling frameworks
GPU Infrastructure: Understanding of GPU provisioning, driver management, and workload scheduling
Billing & Metering: Implemented or integrated usage-based billing and account credit systems
Marketplace Dynamics: Knowledge of dynamic pricing, spot instances, or supply-demand balancing mechanisms
Security & Multi-Tenancy: Experience designing secure, multi-tenant systems in cloud environments
Programming: Strong programming skills in Python and C++; ability to write performant, maintainable, well-architected code
Database Expertise: Comfortable designing schemas and queries for large-scale data systems (PostgreSQL preferred)

Nice To Have

Experience with GPU security, virtualization, or zero-trust compute isolation
Prior startup experience or end-to-end product ownership

Benefits

Comprehensive health, dental, vision, and life insurance
401(k) with company match
Meaningful early-stage equity
Onsite meals, snacks, and close collaboration with founders and tech leads
Ambitious, fast-paced startup culture where initiative is rewarded

Application Details

After submitting your application, our technical team reviews your credentials. If selected, you’ll proceed through the following stages: 15 min – Initial screening with member of your future team (virtual); 40 min – Systems and architectures (virtual); 1 hour – LLM-assisted coding assessment (virtual); 2 hours – Meet and greet with coding assessment (on-site). We aim to complete the interview process in about one week.

Apply now

Apply directly inside Vast.ai.

Research

2 roles

Open each role for the full description, application details, and submission path.

AI Agent Researcher

Help build the next generation of general learning agents — cutting-edge research on memory, reliability, and reasoning. On-site in SF or LA.

Expand role

San Francisco or Los Angeles, CAOn-siteFull-time$160K – $320K

As an Agent Research Engineer, you will help advance the highest levels of the tech stack to imbue AI systems with true agency. Collaborating directly with our technical founder and diverse team, we will build the next generation of general learning agents. Possessing and maintaining a wide, deep and holistic knowledge base of cutting-edge research is crucial for advancing the mission. Many are called, but few are chosen. Tech stack: Python, LLMs, ANNs, C++/Cuda.

What You Will Do

Lead cutting-edge research in AI agents, focusing on memory, reliability, and reasoning
Prototype and evaluate novel state-of-the-art methods/models

What We Need

Deep expertise in systems engineering across the tech stack
Strong holistic background in machine learning theory and practice
Diverse knowledge of neural architectures/circuits: transformers etc
Published research at top AI conferences

Benefits

Comprehensive health, dental, vision, and life insurance
401(k) with company match
Meaningful early-stage equity
Onsite meals, snacks, and close collaboration with founders/tech leaders
Ambitious, fast-paced startup culture where initiative is rewarded

Application Details

After submitting your application, our technical team reviews your credentials. If selected, you'll proceed through the following stages: Initial screening (virtual 15 minutes); Quick dive into Vast, systems and architectures (virtual 30 minutes); LLM assisted coding assessment (virtual 1 hour); Meet and greet with coding assessment (on-site 2 hours). Our goal is to complete the interview process in two weeks.

Apply now

Apply directly inside Vast.ai.

Systems/GPU Research Engineer

Develop new kernels, algorithms, and high-performance tensor libraries that push AI model inference forward. On-site in SF or LA.

Expand role

San Francisco or Los Angeles, CAOn-siteFull-time$160K – $320K

As a systems/GPU engineer, you will play a crucial role in developing new kernels and algorithms that can improve inference for AI models. You will help develop new high-performance tensor libraries and auto-optimization tools. Collaborating directly with our technical founder and diverse team, you will enhance the performance and efficiency of our AI systems. Your ability to research and stay on top of cutting-edge papers will be vital in staying up-to-date with the latest advancements in AI model inference and GPU programming techniques. Tech stack: C++/CUDA, GPGPU, Python, Linux.

What You Will Do

Develop or extend parallel generic GPU libraries and kernels
Help design and deploy market-based resource management systems
Quickly investigate and summarize options for new system architectures
Prototype and evaluate novel state-of-the-art methods/models
Investigate and learn new frameworks and tools

What We Need

Expertise in systems engineering across the tech stack
Deep understanding of GPU architectures
Strong holistic background in neural network performance and tooling
Published research at top AI conferences

Benefits

Comprehensive health, dental, vision, and life insurance
401(k) with company match
Meaningful early-stage equity
Onsite meals, snacks, and close collaboration with founders/tech leaders
Ambitious, fast-paced startup culture where initiative is rewarded

Application Details

Apply now

Apply directly inside Vast.ai.