Pre-recorded audio
The Speech Recognition model enables you to transcribe spoken words into written text and is the foundation of all AssemblyAI products. On top of the core transcription, you can enable other features and models, such as Speaker Diarization, by adding additional parameters to the same transcription request.
Basic transcription workflows
Batch transcription
Hosting audio files
Speaker Labels
Identifying speakers in audio recordings
Iterate over Speaker Labels with Make.com
Calculate the Talk / Listen Ratio of Speakers
Plot A Speaker Timeline with Matplotlib
Generate Custom Speaker Labels with Pyannote
Use Speaker Diarization with Async Chunking
Setup A Speaker Identification System using Pinecone & Nvidia TitaNet