AssemblyAI
Features
Solutions
Podcast
Accurate transcriptions for accessibility, brand safety, and search
Video
Easily add captions, highlights, and indexing for your videos
Telephony
Industry leading transcription for call tracking, PII redaction, and visual voicemail
Solutions
Podcast
Video
Telephony
Compare
AWS Transcribe
The #1 AWS Transcribe API Alternative
Google Speech-to-Text
The #1 Google Speech API Alternative
Compare
AWS Transcribe
Google Speech-to-Text
Pricing
Docs
About
Blog
Sign Up
Login
Sign Up
Login
#1 Speech-to-Text
API for Developers
Start building now
Contact sales
Industry-leading API, trusted by startups and global enterprises in production
Benchmark
See how AssemblyAI compares to Google, AWS, and other providers on your data
Compare now
Code samples
Designed for Developers
Stop struggling to understand messy APIs and SDKs - AssemblyAI makes it easy for any developer to quickly transcribe their audio and video content
View the API Docs
Features
An API for more than just Speech Recognition
Speaker Diarization
PII Redaction
Automatic Transcription Highlights
99.9%+ Uptime
Model Customization
Entity Detection
Advanced Security and Data Privacy Protocols
See all features
Speaker Diarization
Know who spoke when
PII Redaction
Detect and remove credit card numbers, SSNs, and more
Automatic Transcription Highlights
Surface the most common keywords and phrases
99.9%+ Uptime
Trusted in production to handle millions of transcriptions each day
Model Customization
On-the-fly add keywords and phrases unique to your application or customers
Entity Detection
Detect things like names, injuries, and places spoken in the text
Advanced Security and Data Privacy Protocols
We never store your data, and follow strict security protocols within our systems
Why AssemblyAI
We’re a Deep Learning company, obsessed with moving the State of the Art forward, and giving developers access to great AI tech
Fastest-improving accuracy
Our research team deploys accuracy improvements weekly, and is constantly pushing the State of the Art.
In-house research team
We conduct our own research, and publish leading papers and blogs about the state of Automatic Speech Recognition.
99.9+ uptime
Our systems operate with 99.9%+ uptime and are highly scalable and redundant.
More than Speech-to-Text
Top-rated accuracy along with sentiment analysis, PII redaction, entity recognition, summarization, and more.