AI models to accurately convert audio files, video files, and live audio streams into text at scale.
AI models to summarize speech, detect hateful content, spoken topics, and more.
Introducing LeMUR, our new framework for applying powerful LLMs to transcribed speech
With a single line of code, LeMUR can quickly process audio transcripts for up to 10 hours worth of audio content, which effectively translates into ~150k tokens, for tasks like summarization and question answer.
We've released our new Conformer-1 model for speech recognition. Conformer-1 was trained on 650K hours of audio data and is our most accurate model to date.
How ChatGPT actually worksblog
Since its release, the public has been playing with ChatGPT and seeing what it can do, but how does ChatGPT actually work?
Flush with new cash, AssemblyAI looks to grow its AI-as-a-service business/TechCrunch
Light-weight probing of unsupervised representations for Reinforcement Learning/AssemblyAI & NYU
Join our fully remote, global team of researchers and engineers.View all open roles