AssemblyAI Blog

Sign-up to stay updated on deep learning and speech to text research, product announcements, and guides to help you with the API.

Building an end-to-end Speech Recognition model in PyTorch

Deep Learning has changed the game in speech recognition with the introduction of end-to-end models. These models take in audio, and directly output transcriptions. Two of the most popular end-to-end models today are Deep Speech by Baidu, and Listen Attend Spell (LAS) by Google. Both Deep Speech ... Read More

Ready to start building?

Begin testing in under 2 minutes