Simple transparent pricing

Free

Get started at no cost

For developers looking to prototype with Speech AI

Access to Speech-to-Text and Audio Intelligence models

  • Speech recognition
  • Speaker diarization
  • Custom spelling and vocabulary
  • Profanity filtering, auto punctuation and casing

Transcribe up to 100 hours of audio

Developer docs and community support

Pay as you go

MOST POPULAR

Start as low as $0.12/hr for Speech-to-Text

For teams ready to integrate Speech AI into their products

Unlimited access to Speech-to-Text, Audio Intelligence, and LeMUR

Streaming Speech-to-Text

Concurrency starting at 200 files and 100 streams

Cancel anytime

Get started

Custom

Personalize your plan

For teams building products at scale

Volume discounts up to 50%

Solution architect support

Higher rate limits

Compliance with EU Data Residency standards

Compare pricing and features

Free

Pay as you go

Custom

Compare pricing and features
Pay as you go
Build your plan
Speech-to-text  Build on top of the most accurate Speech-to-Text model on the market with >92.5% accuracy
Tiers  

Best

Free

$0.37  / hour

Lower rates  based on volume

Nano

Free

$0.12  / hour

Lower rates  based on volume

Features  

Speaker Diarization

Automatic Language Detection

Profanity Filtering

Custom Vocabulary

Dual Channel

Filler Word Filtering

Custom Spelling

Word Timestamps

Auto Punctuation and Casing

ITN/Formatting

Confidence Scores

Word Search

Export SRT/VTT Captions

Export Paragraphs/Sentences

Streaming Speech-to-text  Transcribe live audio and video files synchronously at low latency and high quality
Tiers  

Best

$0.47  / hour

Lower rates  based on volume

Features  

Auto Punctuation and Casing

Custom Vocabulary

End of Utterance Detection

ITN/Formatting

Speech Understanding  Extract maximum value from your voice data with our Audio Intelligence, models and LLMs
LeMUR  Apply LLMs to voice data and explore a variety of LLM capabilities

LeMUR Default

$0.015  / 1K tokens (Input)

$0.043  / 1K tokens (Output)

Lower rates  based on volume

LeMUR Claude 2.1

$0.015  / 1K tokens (Input)

$0.043  / 1K tokens (Output)

Lower rates  based on volume

LeMUR Basic

$0.002  / 1K tokens (Input)

$0.005  / 1K tokens (Output)

Lower rates  based on volume

Audio Intelligence  Analyze and extract insights from voice data

Entity Detection

Free

$0.08  / hour

Lower rates  based on volume

Topic Detection

Free

$0.15  / hour

Lower rates  based on volume

Key Phrases

Free

$0.01  / hour

Lower rates  based on volume

PII Audio Redaction

Free

$0.05  / hour

Lower rates  based on volume

PII Redaction

Free

$0.08  / hour

Lower rates  based on volume

Sentiment Analysis

Free

$0.02  / hour

Lower rates  based on volume

Content Moderation

Free

$0.15  / hour

Lower rates  based on volume

Auto Chapters

Free

$0.08  / hour

Lower rates  based on volume

Summarization

Free

$0.03  / hour

Lower rates  based on volume

Rate Limits  

Hours of audio

Up to 100 hours

Unlimited

Unlimited

Concurrency

5 files

Starting at 200 files

Talk to us

Security and Privacy  

GDPR

PCI-DSS

SOC 2 Type 1/Type 2

EU Data Residency

Limited

Frequently asked questions

Start building with AssemblyAI

Get started in seconds

1
2
3
4
5
6
7
import assemblyai as aai
import json

transcriber = aai.Transcriber()
transcript = transcriber.transcribe(URL, config)

print(json.dumps(transcript, indent=2))