customers
All customer stories
Top Voice AI companies are building with Assembly.
resources
Latest Release
Universal 3.5 Pro Realtime
The first streaming speech-to-text model that takes the agent's question as input.
resources
Video visits add variable audio, headsets, and network jitter on top of clinical vocabulary. Universal-3 Pro with Medical Mode holds a 3.2% Missed Entity Rate on that audio and runs in real time during the call — so your scribe, captions, or triage layer keeps up. One parameter, on async or realtime.
Missed Entity Rate on clinical conversations with Medical Mode.
Fewer missed medical entities than Universal-3 Pro alone.
Fewer medical entity errors than the base model.
Lowest Missed Entity Rate vs. Deepgram, Speechmatics, AWS, and Google.
Stream the call's audio into one WebSocket and get medical-grade transcripts back in real time — no model swap, no re-integration.
from assemblyai.streaming.v3 import StreamingClient, StreamingParameters
client = StreamingClient(api_key="YOUR_API_KEY")
# Telehealth: real-time transcription during the video visit
client.connect(StreamingParameters(
speech_model="universal-3-5-pro",
domain="medical-v1",
keyterms_prompt=["telehealth", "hypertension", "lisinopril", "follow-up"],
)) ~150ms P50 on Universal-3.5 Pro Realtime for live captions and in-visit documentation.
Hallucination rate ~30% lower than Whisper; holds accuracy through accents, headsets, and compressed call audio.
English, Spanish, German, French with native code-switching.