Simple transparent pricing

Free

Get started at no cost

For developers looking to prototype with Speech AI

Access to Speech-to-Text and Audio Intelligence models

Speech recognition
Speaker diarization
Custom spelling and vocabulary
Profanity filtering, auto punctuation and casing

Transcribe up to 100 hours of audio

Developer docs and community support

Start building for free

Pay as you go

Custom

Personalize your plan

For teams building products at scale

Volume discounts up to 50%

Solution architect support

Higher rate limits

Compliance with EU Data Residency standards

Compare pricing and features

	Free Start for free	Pay as you go Build your plan	Custom Talk to us

Speech-to-text Build on top of the most accurate Speech-to-Text model on the market with >92.5% accuracy

Tiers

Best

Free

$0.37 / hour

Lower rates based on volume

Nano

Free

$0.12 / hour

Lower rates based on volume

Features

Speaker Diarization

Automatic Language Detection

Profanity Filtering

Custom Vocabulary

Dual Channel

Filler Word Filtering

Custom Spelling

Word Timestamps

Auto Punctuation and Casing

ITN/Formatting

Confidence Scores

Word Search

Export SRT/VTT Captions

Export Paragraphs/Sentences

Streaming Speech-to-text Transcribe live audio and video files synchronously at low latency and high quality

Tiers

Best

$0.47 / hour

Lower rates based on volume

Features

Auto Punctuation and Casing

Custom Vocabulary

End of Utterance Detection

ITN/Formatting

Speech Understanding Extract maximum value from your voice data with our Audio Intelligence, models and LLMs

LeMUR Apply LLMs to voice data and explore a variety of LLM capabilities

Claude 3.5 Sonnet

latest

$0.003 / 1K tokens (Input)

$0.015 / 1K tokens (Output)

$0.003 / 1K tokens (Input)

$0.015 / 1K tokens (Output)

Claude 3 Opus

latest

$0.015 / 1K tokens (Input)

$0.075 / 1K tokens (Output)

$0.015 / 1K tokens (Input)

$0.075 / 1K tokens (Output)

Claude 3 Haiku

latest

$0.00025 / 1K tokens (Input)

$0.00125 / 1K tokens (Output)

$0.00025 / 1K tokens (Input)

$0.00125 / 1K tokens (Output)

Claude 3 Sonnet

latest

$0.003 / 1K tokens (Input)

$0.015 / 1K tokens (Output)

$0.003 / 1K tokens (Input)

$0.015 / 1K tokens (Output)

Claude 2.1

$0.015 / 1K tokens (Input)

$0.043 / 1K tokens (Output)

Lower rates based on volume

Claude 2.0

$0.015 / 1K tokens (Input)

$0.043 / 1K tokens (Output)

Lower rates based on volume

Claude Instant

$0.002 / 1K tokens (Input)

$0.005 / 1K tokens (Output)

Lower rates based on volume

Audio Intelligence Analyze and extract insights from voice data

Entity Detection

Free

$0.08 / hour

Lower rates based on volume

Topic Detection

Free

$0.15 / hour

Lower rates based on volume

Key Phrases

Free

$0.01 / hour

Lower rates based on volume

PII Audio Redaction

Free

$0.05 / hour

Lower rates based on volume

PII Redaction

Free

$0.08 / hour

Lower rates based on volume

Sentiment Analysis

Free

$0.02 / hour

Lower rates based on volume

Content Moderation

Free

$0.15 / hour

Lower rates based on volume

Auto Chapters

Free

$0.08 / hour

Lower rates based on volume

Summarization

Free

$0.03 / hour

Lower rates based on volume

Rate Limits

Hours of audio

Up to 100 hours

Unlimited

Concurrency

5 files

Starting at 200 files

Talk to us

Security and Privacy

GDPR

PCI-DSS

SOC 2 Type 1/Type 2

EU Data Residency

Limited

Frequently asked questions

What are the differences between Speech-to-Text tiers?

Can I sign up for free?

Do you offer volume discounts?

How fast does it take for audio and video files to process?

How does billing work?

How can I talk to someone?

What languages do you support?

What is a token?

Start building with AssemblyAI

Get started in seconds

Use our API Contact sales

import assemblyai as aai
import json

transcriber = aai.Transcriber()
transcript = transcriber.transcribe(URL, config)

print(json.dumps(transcript, indent=2))