Multilingual streaming (Beta)
Supported languages
English, Spanish, French, German, Italian, and Portuguese
Coming soon
Our multilingual feature is currently in Beta. We’re actively working on improvements to our multilingual offering including reduced latency, returning language codes, and adding more languages. Check our public roadmap to see what’s currently being worked on.
Multilingual streaming allows you to transcribe audio streams in multiple languages.
Configuration
Keyterms prompting is not supported with multilingual streaming.
To utilize multilingual streaming, you need to include "language":"multi"
as a query parameter in the WebSocket URL.
Supported languages
Multilingual currently supports English, Spanish, French, German, Italian, and Portuguese.
Understanding formatting
The multilingual model produces transcripts with punctuation and capitalization already built into the model outputs. This means you’ll receive properly formatted text without requiring any additional post-processing.
While the API still returns the turn_is_formatted
parameter to maintain interface consistency with other streaming models, the multilingual model doesn’t perform additional formatting operations. All transcripts from the multilingual model are already formatted as they’re generated.
In the future, this built-in formatting capability will be extended to our English-only streaming model as well.
Quickstart
Python
Javascript
Firstly, install the required dependencies.