Supported Languages & Features
AssemblyAI supports a wide range of languages across our speech-to-text models for pre-recorded audio. The available languages and features vary by model. Check out the Models page to learn more about our different models and how to choose the best one for your use case. See our Model selection page for more details on specifying a model in your request.
Universal-3 Pro
Regional dialects and variants
Universal-3 Pro goes beyond standard language support with deep understanding of regional dialects and local variants. Whether your audio features Quebecois French, Mexican Spanish, or Brazilian Portuguese, the model accurately captures speech as it’s naturally spoken — including colloquial expressions, local vocabulary, and accent-specific pronunciation patterns.
Dialect support
You do not need to specify a dialect code to get accurate dialect transcription. Universal-3 Pro automatically recognizes regional speech patterns when using the base language code (e.g., fr for all French dialects, es for all Spanish dialects).
English dialects and variants
Spanish dialects and variants
French dialects and variants
Portuguese dialects and variants
Italian dialects and variants
Universal-2
Breakdown of Universal-2 language support
High accuracy (≤ 10% WER)
English, Spanish, French, German, Indonesian, Italian, Japanese, Dutch, Polish, Portuguese, Russian, Turkish, Ukrainian, Catalan
Good accuracy (>10% to ≤25% WER)
Arabic, Azerbaijani, Bulgarian, Bosnian, Mandarin Chinese, Czech, Danish, Greek, Estonian, Finnish, Galician, Hindi, Croatian, Hungarian, Korean, Macedonian, Malay, Norwegian, Romanian, Slovak, Swedish, Swiss German, Tagalog, Thai, Urdu, Vietnamese
Moderate accuracy (>25% to ≤50% WER)
Afrikaans, Belarusian, Welsh, Persian (Farsi), Hebrew, Armenian, Icelandic, Kazakh, Lithuanian, Latvian, Maori, Marathi, Slovenian, Swahili, Tamil
Fair accuracy (>50% WER)
Amharic, Assamese, Bengali, Gujarati, Hausa, Javanese, Georgian, Khmer, Kannada, Luxembourgish, Lingala, Lao, Malayalam, Mongolian, Maltese, Burmese, Nepali, Occitan, Punjabi, Pashto, Sindhi, Shona, Somali, Serbian, Telugu, Tajik, Uzbek, Yoruba