PAICON

PAICON Accelerates Global, Data-Centric Cancer Diagnostics with Vast.ai

How a global oncology data platform used Vast.ai’s GPU marketplace to rapidly iterate on Athena—validating that diversity can matter more than scale—while significantly reducing research-phase training costs.

PAICON Accelerates Global, Data-Centric Cancer Diagnostics with Vast.ai

60%+

Research-Phase Training Cost Reduction

Agility

Parallel experiments at research scale

Speed

Faster iteration and feedback cycles

Hybrid

Vast.ai for R&D, hyperscalers for production

Overview

Industry: Medical AI
Key area: Data-centric oncology and multimodal clinical data

PAICON is a Germany-based, data-centric oncology company building the foundations for truly generalizable medical AI. Its core advantage is access to globally distributed, real-world clinical datasets through long-term collaborations with hospitals, labs and research institutions -- spanning geographies, patient populations and technical imaging variability.
PAICON unifies and quality-controls multimodal medical data, starting with oncology, to enable partners to develop -- and regulators to trust -- AI systems that work beyond a single region, scanner or protocol. To demonstrate what this approach makes possible, PAICON built Athena, a histopathology foundation model designed to learn from diversity rather than sheer volume -- supporting the principle that variety can outperform scale when the goal is real-world generalization.
PAICON supports and actively engages in the development of clinical-grade AI by providing curated, governance-ready datasets and the infrastructure needed to train and validate models across heterogeneous real-world conditions -- helping reduce time-to-research and improving external validity across sites.
Led by Dr. Manasi Aichmüller-Ratnaparkhe (CEO & Co-founder), Danny Quick (COO) and Dr. Christian Aichmüller (CTO & Co-founder), PAICON works with a growing network of global collaborators. This gives PAICON access to one of the most geographically and technically diverse pathology datasets -- capturing differences in staining, scanner hardware, lab workflows and population-level variation that often cause medical AI systems to fail when deployed outside the environments they were trained on.

The Challenge: Enabling Research-Scale Iteration

Building generalizable medical AI is both a data and compute challenge. Robust models require large multimodal datasets, strict governance requirements, and extensive testing across sources of variation.
The training strategy behind Athena required extensive experimentation -- not just a single large training run -- because robustness emerges from iterating across diverse subsets, sampling strategies, architectures, augmentation approaches and evaluation setups.
These workloads required substantial GPU compute to support:
  • Training and adapting vision models across multi-site, multi-region datasets
  • Running many parallel experiments
  • Validating generalization across technical and population diversity
Hyperscale cloud platforms such as AWS are excellent for production deployments --offering reliability, security tooling and global availability. However, Athena’s R&D phase required a high volume of experiments at the frontier of cost-efficiency. In that environment, premium GPU instance pricing and periodic capacity constraints made sustained, large-scale iteration economically challenging.
For production workloads, hyperscalers remain a strong fit. For research-scale experimentation, PAICON required a way to iterate rapidly without costs becoming prohibitive.
Medical research also demands flexibility around where compute runs and how workflows are controlled -- particularly when operating across regions with varying governance requirements.

The Solution: Hybrid Compute with Vast.ai

PAICON used Vast.ai’s decentralized GPU marketplace to access large, multi-GPU configurations with pricing that made research-scale iteration feasible. This enabled the team to run more experiments, explore robustness strategies and accelerate development of Athena -- while retaining hyperscalers for production deployments when required.
The hybrid workflow included:
  • Curating and pre-processing datasets in existing environments before exporting training-ready subsets
  • Training and fine-tuning on multi-GPU Vast.ai machines for high-throughput experimentation
  • Using on-demand clusters to run parallel trials and rapidly iterate on model variants
Vast.ai’s flexible supply model complemented PAICON’s multinational collaboration footprint, supporting dynamic research workloads without locking experimentation into rigid infrastructure models.

“With Vast.ai, we can scale experiments up and down quickly -- moving from 4 to 8 GPUs when needed -- while keeping iteration economically sustainable.”

See How Vast.ai can Transform AI for You

Discover how Vast.ai powers AI innovation. Our team will guide you through a tailored platform walkthrough and show how we combine enterprise-level capabilities with startup-friendly pricing.

Results

Significant Training Cost Reduction by 60%+

By shifting large training workloads to Vast.ai, PAICON substantially reduced research-phase GPU training costs compared with comparable hyperscale configurations.

Greater Experimental Agility

On-demand access to large GPU instances enabled parallel experimentation and faster evaluation of robustness strategies across diverse data slices.

Faster Research Cycles

Multi-GPU training runs across heterogeneous datasets completed reliably, allowing faster feedback loops. Early Athena experiments showed encouraging cross-tissue signals, supporting PAICON’s thesis that diversity-aware training improves generalization.

Beyond Foundation Models: Knowledge Systems for Diagnostics

In parallel with vision research, PAICON explored how curated pathology knowledge corpora can support education and decision support. Vast.ai’s GPU capacity enabled prototyping and fine-tuning of domain-adapted language models within PAICON’s broader data-centric roadmap.

Why PAICON Stands Out

PAICON’s differentiator is data diversity with governance: aggregating real-world clinical data across regions, labs and technical pipelines to enable AI systems that hold up outside controlled environments.
By representing population-level and laboratory-level variability -- rather than optimizing only for size -- PAICON helps partners build models that are more robust, less biased and more transferable across sites.
Athena serves as a proof point of this thesis: a foundation model built to learn from heterogeneity, demonstrating why diversity-aware training is essential for clinical deployment – enabled by and benefiting from Vast.ai’s globally distributed compute infrastructure.

Looking Ahead

PAICON will continue expanding its global dataset and collaboration network, extending Athena to additional disease areas and modalities over time. Vast.ai will remain an important component of PAICON’s R&D compute stack for rapid experimentation.

About PAICON

PAICON is a Germany-based, data-centric medical diagnostics company enabling and participating in the development of generalizable AI through globally sourced clinical datasets and partner-driven validation.
Focus areas include:
  • A global, governance-ready oncology data platform
  • Data curation, harmonization and QC pipelines across heterogeneous pathology sources
  • Foundation model research to improve transfer across tissue types and sites
  • Collaboration enablement: datasets and evaluation frameworks for partners building clinical AI
Mission: Enabling precision oncology by making globally representative clinical data accessible for trustworthy AI development.

Start Building AI Without Limits

Join thousands of developers and enterprises running AI workloads on Vast.

Get Started
Creatix Technology

Creatix Technology

Creatix Technology Scales to 200K Daily Users with Vast.ai's GPU Cloud

How a fast-growing AI app company cut infrastructure costs by over 60% and powered millions of new users with Vast.ai.

Tech