Featured Resources
Introducing Conformer-1announcements
We've released our new Conformer-1 model for speech recognition. Conformer-1 was trained on 650K hours of audio data and is our most accurate model to date.
How ChatGPT actually worksblog
Since its release, the public has been playing with ChatGPT and seeing what it can do, but how does ChatGPT actually work?
Stable Diffusion 1 vs 2 - What you need to knowblog
Learn where the differences between the two models stem from and what they mean in practice in this simple guide.
Hundreds of businesses, including dozens of Fortune 500s, process millions of audio files every day with AssemblyAI's API. We give innovative businesses the tools they need to quickly ship exciting new products and applications built on top of audio data.
Trusted by companies of all sizes — from startups to Fortune 500
Our Products
Automatically convert audio and video into text. Go beyond Speech-to-Text by turning unstructured data into structured data with advanced summarization, topic detection, tagging, content moderation, and other Audio Intelligence APIs - all with a single network request.
Async transcription
Real-time transcription
Speaker labels
International languages
A Secure Solution
We are proudly SOC 2 (Type 1 and 2) certified and GDPR-compliant. Leading investors and Fortune 500 companies trust AssemblyAI in production every day to securely process millions of sensitive audio files.
French, German, and Italian transcriptions are now publicly available. We have also released v2 of our Spanish model, improving absolute accuracy by 4%.
Ready for Scale
We work with dozens of Fortune 500s to process millions of audio and video files every day. Product teams at large, innovative businesses rely on AssemblyAI to quickly ship exciting new products and features built on top of audio data.