customers
All customer stories
Top Voice AI companies are building with Assembly.
resources
Latest Release
Voice Agent API
Voice agents that get it right, respond instantly, and ship the same day with our new Voice Agent API
resources
Stop maintaining Whisper infrastructure. Get better accuracy and a full suite of features with a managed API:
Your transcriptions will show here...
Transcribe over 99+ languages and counting, including Global English (English and all of its accents).
Detect the number of speakers in your audio file, with each word in the text associated with its speaker.
Automatically detect languages and route to the appropriate model for transcription.
Connect with multiple LLM providers including Claude, GPT, Gemini, and more.
Need more than transcription? AssemblyAI's Voice Agent API lets you build full voice pipelines — STT, LLM, TTS — without stitching together separate services.
Ultra-fast and ultra-accurate real-time speech-to-text, unlimited concurrency, and usage-based pricing.
Use prompt engineering to control transcription style and improve accuracy for domain-specific terminology.
Translate transcripts into over 100 languages with a single API request.
Get your free API key and ship your first transcript in minutes — no infrastructure to maintain.
AssemblyAI's managed API endpoint and diarization won me over—something Whisper couldn't provide.
Josh Mohrer, Founder at Wave.co
Test our best-in-class speech-to-text and voice agent models in our no-code playground.