Translation
Supported languages
enen_auen_uken_usesfrdeitptnlhijazhfikoplrutrukviafsqamarhyasazeubebnbsbgcahrcsdaetglkaelguhthahawhehuisidjwknkklolalvltlbmkmgmsmlmtmimrmnnenopapsfarosrsnsdsiskslsosuswsvtltgtateuruzcyyiyoSupported models
slam-1universalSupported regions
US only
Overview
The Translation feature automatically converts your transcribed audio content from one language to another, enabling you to reach global audiences without manual translation work. You can translate transcripts into over 100 languages with a single API request.
Key capabilities:
- Translate to multiple target languages simultaneously
- Choose between formal and informal translation styles
- Translate during transcription or add translations to existing transcripts
- Get full-text translations that preserve the original meaning and context
- Get per-speaker translated utterances when using Speaker Labels
Common use cases:
- Creating multilingual subtitles for video content
- Translating customer support calls for international teams
- Localizing podcast episodes for different markets
- Making educational content accessible in multiple languages
- Generating multilingual meeting summaries
Quickstart
There are two ways to use Translation:
- Transcribe and translate in one request - Best when you’re starting a new transcription and want to automatically translate the transcript text as part of that process
- Transcribe and translate in separate requests - Best when you already have text that you would like to translate or for more complicated workflows where you want to separate the transcription and translation tasks
Method 1: Transcribe and translate in one request
This method is ideal when you’re starting fresh and want both transcription and translation in a single workflow.
Python
JavaScript
Method 2: Transcribe and translate in separate requests
This method is useful when you already have text that you would like to translate or for more complicated workflows where you want to separate the transcription and translation tasks.
Python
JavaScript
Expected output:
Output format
The Translation API returns translations in the translated_texts key of the response. This key contains an object where each property is a language code corresponding to one of your target languages, and the value is the full translated text.
Example response structure:
API reference
Request
Method 1: Transcribe and translate in one request
When creating a new transcription, include the speech_understanding parameter directly in your transcription request:
Method 2: Add translation to existing transcripts
For existing transcripts, retrieve the completed transcript and send it to the Speech Understanding API:
Response
The Translation API returns your original transcript response with an additional translated_texts key containing the translations.
| speech_understanding | object | Container for speech understanding request and response information. |
| speech_understanding.request | object | The original translation request configuration that was submitted. |
| speech_understanding.request.translation | object | The translation parameters that were used. |
| speech_understanding.response | object | The response information from the translation process. |
| speech_understanding.response.translation | object | Status information about the translation. |
| speech_understanding.response.translation.status | string | The status of the translation. Will be "success" when translation completes successfully. |
Key differences from standard transcription
All other fields from the original transcript (text, words, utterances, confidence, etc.) remain unchanged.