Out-of-the-box features for any Speech-to-Text application, at any scale.
Out-of-the-box ready features built for any project that requires Speech-to-Text.
Our API is built using the latest advances in Deep Learning, using cutting edge research from our in-house team of AI researchers.
Casing and punctuation of proper nouns are automatically added to the transcription text, to make transcripts produced by the API more readable.
Word-by-word timestamps across the entire transcript text.
Get a confidence score for each word in the transcript.
Don't worry about file formats or sampling rates, our API supports virtually all audio/video files without any transcoding required.
The number of speakers in the audio can be automatically detected, and each word transcribed associated with its speaker.
Boost accuracy for your specific use case with these simple customization options.
Phone calls recorded in stereo/dual channel are transcribed separately, and you'll get a transcript for each channel.
Boost accuracy for key terms and phrases like person or product names, places, and other vocabulary unique to your application.
Select a customized model for Australian, UK, South African, and more dialects to boost accuracy on your data.
Get more of out your transcripts with these easy-to-use features.
The API can automatically detect key phrases and words in your transcription text - very useful for tagging content, and providing a summary of the transcription text.
Easily export your transcription in SRT or VTT format, to be plugged into a video player for subtitles and closed captions.
Automatically detect and replace sensitive data, like credit card numbers and social security numbers, in the transcription text and source audio.
Automatically detect and replace profanity in the transcription text.
Best-practice security standards to keep your data safe, secure, and private.
We adhere to best-practice guidelines like encryption in transit and at rest.
We are not in the business of monetizing your data. Files sent to the API for transcription are never stored, and you can request the deletion of transcription text permanently from our database.
Thousands of developers and startups trust AssemblyAI to power core features in production, with over 99.9% uptime and transcript completion rate.
We're here to work with you as much, or as little, as you'd like. Talk to us over live chat, Slack, email, or phone.