Features
AssemblyAI
Universal-3 Pro Streaming
Deepgram
Nova-3
OpenAI
GPT-4o Transcribe
Microsoft
Azure
ElevenLabs
Scribe V2
Entity accuracy
(Credit card numbers, emails, etc.)

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Industry leading
Low
accuracy
Low
accuracy
Low
accuracy
Low
accuracy
Speaker diarization performance

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Industry Leading
Unreliable
Unreliable
Unreliable
Unlimited concurrency, no rate limits

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Dynamic keyterms prompting
(turn-by-turn)

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Static only

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Real-time prompting

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Usage-based pricing, no contracts

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Commitments
and overages

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Contracts at scale

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

LiveKit / Pipecat / Twilio
native support

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Partial

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Identify a wide range of entities that are spoken in your audio files, such as person and company names, email addresses, dates, and locations.

Feature

Dataset

Transcripts

Perfect

WER Mean

Pooled WER

TTFS Median

TTFS P95

TTFS P99

assemblyai

99.8%

66.8%

3.49%

3.02%

256ms

362ms

417ms

aws

100.0%

77.4%

1.68%

1.75%

1136ms

1527ms

1897ms

azure

100.0%

82.9%

1.21%

1.18%

1016ms

1345ms

1791ms

cartesia

99.9%

60.5%

3.92%

4.36%

266ms

364ms

898ms

deepgram

99.8%

76.5%

1.71%

1.62%

247ms

298ms

326ms

elevenlabs

99.7%

81.3%

3.16%

3.12%

281ms

348ms

407ms

google

100.0%

69.0%

2.84%

2.85%

878ms

1155ms

1570ms

openai

99.3%

75.9%

3.24%

3.06%

637ms

965ms

1655ms

sonix

99.8%

84.1%

1.25%

1.29%

249ms

281ms

310ms

speechmatics

99.7%

83.2%

1.40%

1.07%

495ms

676ms

736ms