Streaming Speech-to-Text

Real-time transcription your notetaker, agents, and captions can depend on

Universal-3 Pro Streaming

Your transcriptions will show here...

Delphi
Happy Scribe
Glean
Granola
Supernormal
Runway
Ashby
Jiminny
JotPsych
Earmark
EdgeTier
Genio
Grain
Loop
Calabrio
Veed.io
Dovetail
WhatConverts
CallRail
Delphi
Happy Scribe
Glean
Granola
Supernormal
Runway
Ashby
Jiminny
JotPsych
Earmark
EdgeTier
Genio
Grain
Loop
Calabrio
Veed.io
Dovetail
WhatConverts
CallRail
Delphi
Happy Scribe
Glean
Granola
Supernormal
Runway
Ashby
Jiminny
JotPsych
Earmark
EdgeTier
Genio
Grain
Loop
Calabrio
Veed.io
Dovetail
WhatConverts
CallRail
Delphi
Happy Scribe
Glean
Granola
Supernormal
Runway
Ashby
Jiminny
JotPsych
Earmark
EdgeTier
Genio
Grain
Loop
Calabrio
Veed.io
Dovetail
WhatConverts
CallRail
Models

Pick the model that fits your workload

Real-time transcription fast enough for voice agents, accurate enough for production.

Compare features

Model
U-3 Pro Streaming Voice agents, AI scribes
Universal Streaming AI notetakers, call centers
Univ. Multilingual Global contact centers
Whisper Streaming Long-tail language support
Price
$0.45 /hr
$0.15 /hr
$0.15 /hr
$0.30 /hr
Languages
EN, ES, FR, DE, IT, PT
English
EN, ES, FR, DE, IT, PT
99+ languages
Natural language prompting
Up to ~1,500 words
Keyterm prompting
Up to ~100 words
Up to ~100 words
Up to ~100 words
Code-switching
Speaker diarization
10+ speakers
10+ speakers
10+ speakers
Medical terminology
Medical mode add-on
Medical mode add-on
Medical mode add-on
HIPAA BAA
On request
On request
On request
On request
Unlimited concurrency
Use cases

Built for every voice workflow

Real-time transcription powers every application where you stream audio.

Common questions