Audio-to-Text Transcription
Rapidly convert audio to accurate text with GPU-powered transcription.
Built for This
- Convert audio files to accurate transcripts using open-source models.
- Handle large-scale transcription workloads with scalable GPU access.
- Support multiple languages and any common audio format inside a locked-down container.
- Launch ready-to-use speech-to-text environments in one click or via CLI.
Models
audio
ACE Step V1 3.5B
ACE-Step is a novel open-source foundation model for music generation that overcomes key limitations of existing approaches through a holistic architectural design
audio
Dia 1.6B
Dia directly generates highly realistic dialogue from a transcript. You can condition the output on audio, enabling emotion and tone control