Real-time speech-to-text with speaker diarization

Voxtral Transcribe 2 delivers ultra-fast, highly accurate speech-to-text with real-time transcription and speaker diarization. Built for live apps, voice agents, and meetings, it supports 13 languages, word-level timestamps, and privacy-first deployment All at industry-leading speed and cost.