Universal-3 Pro Streaming, the most accurate real-time transcription model for voice agents, is now live! Learn more

PlaygroundChangelogCommunitySign In
OverviewAPI ReferencePre-recorded STTStreaming STTVoice AgentsSpeech UnderstandingGuardrailsLLM Gateway
OverviewAPI ReferencePre-recorded STTStreaming STTVoice AgentsSpeech UnderstandingGuardrailsLLM Gateway
  • Getting started
    • Transcribe streaming audio
    • Model selection
    • Evaluations
    • Cloud endpoints & data residency
    • Webhooks
  • Models & features
    • Whisper Streaming
    • Keyterms prompting
    • Diarization and multichannel
    • Authenticate with a temporary token
    • Common session errors and closures
    • API reference
  • Migration guides
    • Migration guide Deepgram to AssemblyAI streaming
    • Migration guide Speechmatics to AssemblyAI streaming
    • Migration guide Gladia to AssemblyAI streaming
  • Guides
    • Build a meeting notetaker
    • Build a medical scribe
    • Build a voice agent
    • Apply LLM Gateway to streaming audio
    • Overview
On this page
  • Basic Streaming Workflows
  • Streaming for Front-End Applications
  • Streaming with LLM Gateway
  • Use Case Specific Streaming Workflows
Guides

Overview

AssemblyAI’s Streaming Speech-to-Text (STT) allows you to transcribe live audio streams with high accuracy and low latency. By streaming your audio data to our secure WebSocket API, you can receive transcripts back within a few hundred milliseconds.

Basic Streaming Workflows

  • Using real-time streaming

  • Transcribe System Audio in Real-Time (macOS)

  • Terminate Streaming Session After Inactivity

  • Migrate from Streaming v2 to Streaming v3 (Python)

  • Migrate from Streaming v2 to Streaming v3 (JavaScript)

Streaming for Front-End Applications

  • Next.js Example Using Streaming STT

  • Vanilla JavaScript Front-End Examples

Streaming with LLM Gateway

  • Use LLM Gateway with Streaming Speech-to-Text (STT)

  • Translate Streaming STT Transcripts with LLM Gateway

Use Case Specific Streaming Workflows

  • Apply Noise Reduction to Audio for Streaming Speech-to-Text

  • Transcribe Audio Files with Streaming Speech-to-Text

  • Evaluate Streaming transcription accuracy with WER

  • Determine Optimal Turn Detection Settings from Historical Audio Analysis

Was this page helpful?
Previous

Transcribe System Audio in Real-Time (macOS)

Next
Built with
LogoLogo
PlaygroundChangelogCommunitySign In