The number of speakers in the audio can be automatically detected, and each word transcribed associated with its speaker. This can help answer the question "Who Spoke When?"
Don't worry about file formats or sampling rates, our API supports virtually all audio/video files without any transcoding required.
Casing and punctuation of proper nouns are automatically added to the transcription text, to make transcripts produced by the API more readable.
Our API is built using the latest advances in Deep Learning, using cutting edge research from our in-house team of AI researchers.
Automatically determine the topics discussed in your audio or video files. This features uses the IAB Taxonomy to predict over 698 different topic labels, such as "Automotive > Self Driving Cars".
Automatically detect sensitive content in your transcriptions, such as content about drugs, weapons, NSFW content, and over 20 other types of content.
Automatically detect and replace sensitive data, like credit card numbers and social security numbers, in the transcription text and source audio.
Thousands of developers and startups trust AssemblyAI to power core features in production, with over 99.9% uptime and transcript completion rate.
We're here to work with you as much, or as little, as you'd like. Talk to us over live chat, Slack, email, or phone.
We adhere to best-practice guidelines like encryption in transit and at rest.