← All roles

Engineering

GPU Systems Engineer – HPC / Parallel Computing

$160K – $320K • Offers Equity • Offers Bonus

San Francisco or Los AngelesOn-siteFull-time
Apply now

Opens our application in Ashby — takes about five minutes.

About Us

Vast.ai’s cloud powers AI projects and businesses all over the world. We are democratizing and decentralizing AI computing—reshaping our future for the benefit of humanity.

We are a growing and highly motivated team dedicated to an ambitious technical plan. Our structure is flat, our ambitions are out‑sized, and leadership is earned by shipping excellence.

We seek engineers with strong intrinsic drive, a true passion for advancing the state of the art, and a mix of architecture, coding, and communication skills.

LOCATION: On-site at our office in San Francisco or Westwood, Los Angeles.

About the Role

We’re looking for a systems engineer with HPC or parallel programming experience to help scale AI inference. You’ll leverage your knowledge of high-performance systems to optimize GPU performance at the bleeding edge of AI.

  • Full-Time

  • On-site at either our SF or LA offices

Tech Stack

CUDA/C++, GPGPU, Python, Linux

Key Responsibilities

  • Design and optimize GPU kernels and tensor libraries

  • Translate HPC techniques into scalable AI inference solutions

  • Evaluate emerging architectures and resource management approaches

  • Collaborate with technical leadership to improve GPU infrastructure efficiency

Ideal Experience

  • Advanced C++ (C++17/20 preferred)

  • Expertise with at least one parallel framework (CUDA, HIP, SYCL, OpenCL, OpenACC, or similar)

  • Strong background in systems optimization and HPC performance tooling

  • Familiarity with distributed training/inference frameworks (bonus)

Interview Process

After submitting your application, our technical team reviews your credentials. If selected, you'll proceed through the following stages:

  • Initial screening (virtual, 15 minutes)

  • Quick dive into Vast, systems and architectures (virtual, 30 minutes)

  • LLM-assisted coding assessment (virtual, 1 hour)

  • Meet and greet with coding assessment (on-site, 2 hours)

Our goal is to complete the interview process in two weeks.

Benefits

  • Comprehensive health, dental, vision, and life insurance

  • 401(k) with company match

  • Meaningful early-stage equity

  • Onsite meals, snacks, and close collaboration with founders/tech leaders

  • Ambitious, fast-paced startup culture where initiative is rewarded

Why Vast.ai

20,000+

GPUs on the platform

25,000+

monthly customers

8 years

of operations data

We're building the infrastructure layer where AI agents and developers programmatically provision and manage GPU compute.

All technical roles report to Jake Cannell, the CEO and founder — a prolific writer and thinker on AI.

LOVE in a simbox is all you needThe Brain as a Universal Learning Machine

Offices in Los Angeles and San Francisco.

We love to work. We can't help it; we are witnessing the birth of AGI.

Apply for this role

Or email the team directly at jobs@vast.ai