GRAHS

Features	AssemblyAI Universal-3 Pro Streaming	Deepgram Nova-3	OpenAI GPT-4o Transcribe	Microsoft Azure	ElevenLabs Scribe V2
Entity accuracy (Credit card numbers, emails, etc.) Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.	Industry leading	Low accuracy	Low accuracy	Low accuracy	Low accuracy
Speaker diarization performance Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations. Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.	Industry Leading	Unreliable	Unreliable	Unreliable
Unlimited concurrency, no rate limits Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations. Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.	Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.
Dynamic keyterms prompting (turn-by-turn) Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations. Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.	Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.	Static only	Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.	Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.	Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.
Real-time prompting Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations. Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.	Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.		Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.	Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.	Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.
Usage-based pricing, no contracts Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations. Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.	Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.	Commitments and overages	Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.	Contracts at scale	Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.
LiveKit / Pipecat / Twilio native support Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations. Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.	Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.	Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.	Partial	Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.	Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Feature
Dataset	Transcripts	Perfect	WER Mean	Pooled WER	TTFS Median	TTFS P95	TTFS P99
assemblyai	99.8%	66.8%	3.49%	3.02%	256ms	362ms	417ms
aws	100.0%	77.4%	1.68%	1.75%	1136ms	1527ms	1897ms
azure	100.0%	82.9%	1.21%	1.18%	1016ms	1345ms	1791ms
cartesia	99.9%	60.5%	3.92%	4.36%	266ms	364ms	898ms
deepgram	99.8%	76.5%	1.71%	1.62%	247ms	298ms	326ms
elevenlabs	99.7%	81.3%	3.16%	3.12%	281ms	348ms	407ms
google	100.0%	69.0%	2.84%	2.85%	878ms	1155ms	1570ms
openai	99.3%	75.9%	3.24%	3.06%	637ms	965ms	1655ms
sonix	99.8%	84.1%	1.25%	1.29%	249ms	281ms	310ms
speechmatics	99.7%	83.2%	1.40%	1.07%	495ms	676ms	736ms