August 1, 2025

Streaming STT Performance Update | August 1, 2025 Newsletter

AssemblyAI's new speaker diarization model delivers 30% better accuracy in noisy audio. Plus: Build 465ms voice agents & learn how Dovetail improved WER by 36%.

Devon Malloy

Staff Growth Manager

Streaming Speech-to-Text

Reviewed by

Table of contents

[Visible on live site]

Streaming Speech-to-Text Improvements: What's New

AssemblyAI has released significant performance improvements for our Streaming Speech-to-Text API, delivering substantial error rate reductions in critical transcription areas. These updates are now live for all users.

Key Performance Improvements for Streaming STT

Our latest streaming improvements have achieved significant performance gains on clean data, with a focus on accuracy for repeated digits and tokens.

Repeating Digits and Tokens Enhancement

What We Fixed:

Missing repeated digits in transcription output
Repetitive token recognition issues (e.g., "yes" "yes")

Performance Metrics:

Previous error rate: 28.20%
New error rate: 13.47%
Total improvement: 52% reduction in error rate

Real-World Impact: This enhancement provides better handling of:

Phone numbers
Confirmation codes
Account numbers
Repetitive speech patterns

Latest AssemblyAI Resources and Tutorials

Real-Time Conversation Intelligence Guide

Discover how real-time conversation intelligence is transforming customer interactions from post-call analysis to live insights. Our comprehensive guide explores how streaming speech-to-text enables proactive engagement and immediate issue resolution.

Read the full guide: Real-time conversation intelligence

Hotword Detection Tutorial with Go and Streaming Speech-to-Text

Learn how to implement hotword detection using AssemblyAI's Universal-Streaming Speech-to-Text API and Go programming language. This tutorial covers:

WebSocket integration for real-time streaming
Real-time voice recognition implementation
Custom hotword trigger configuration
Practical applications for voice-activated systems

Access the tutorial: Hotword detection with AssemblyAI streaming speech-to-text

Video Tutorial: Building an AI Meeting Scheduling Assistant

Our latest YouTube tutorial demonstrates how to build an AI-powered meeting scheduling assistant using AssemblyAI's Speech-to-Text API. The step-by-step guide includes:

Voice input integration with AssemblyAI
Natural language meeting request parsing
Automated calendar availability checking
Programmatic meeting invitation sending

Watch on YouTube: Build an AI Assistant for Meeting Scheduling

Try AssemblyAI's Improved Streaming Performance

Experience the enhanced streaming accuracy firsthand in the AssemblyAI Playground. Upload your audio files or test with our examples—no coding required.

Get Started with AssemblyAI Streaming Speech-to-Text

Ready to implement these improvements in your application? Here are your next steps:

Review the Documentation: Visit our Streaming Speech-to-Text API docs
Test in the Playground: Try the AssemblyAI Playground with your own audio

‍

Streaming STT Performance Update | August 1, 2025 Newsletter

Streaming Speech-to-Text Improvements: What's New

Key Performance Improvements for Streaming STT

Repeating Digits and Tokens Enhancement

Latest AssemblyAI Resources and Tutorials

Real-Time Conversation Intelligence Guide

Hotword Detection Tutorial with Go and Streaming Speech-to-Text

Video Tutorial: Building an AI Meeting Scheduling Assistant

Try AssemblyAI's Improved Streaming Performance

Get Started with AssemblyAI Streaming Speech-to-Text

AssemblyAI's Universal-3.5 Pro Realtime is the only model in Coval's Human Parity Zone

Why real-time is the future of speech-to-text

Best real-time speech-to-text apps in 2026

How to choose the best speech-to-text API for voice agents

How to build the lowest latency voice agent in Vapi: Achieving ~465ms end-to-end Latency

Fast ASR for voice agents: bring your own turn detection

Handling transcript errors: Homophones, corrections and AI quality improvement

New - 8.37% Better Accuracy for Topic Detection and IAB Classification with V4 Update

Streaming STT Performance Update | August 1, 2025 Newsletter

Streaming Speech-to-Text Improvements: What's New

Key Performance Improvements for Streaming STT

Repeating Digits and Tokens Enhancement

Latest AssemblyAI Resources and Tutorials

Real-Time Conversation Intelligence Guide

Hotword Detection Tutorial with Go and Streaming Speech-to-Text

Video Tutorial: Building an AI Meeting Scheduling Assistant

Try AssemblyAI's Improved Streaming Performance

Get Started with AssemblyAI Streaming Speech-to-Text

Related posts

AssemblyAI's Universal-3.5 Pro Realtime is the only model in Coval's Human Parity Zone

Why real-time is the future of speech-to-text

Best real-time speech-to-text apps in 2026

How to choose the best speech-to-text API for voice agents

How to build the lowest latency voice agent in Vapi: Achieving ~465ms end-to-end Latency

Fast ASR for voice agents: bring your own turn detection

Handling transcript errors: Homophones, corrections and AI quality improvement

New - 8.37% Better Accuracy for Topic Detection and IAB Classification with V4 Update