Multichannel Transcription
Supported Languages, Regions, and Models
Multichannel transcription is supported for all languages, regions, and models.
If you have a multichannel audio file with multiple speakers, you can transcribe each of them separately.
The response includes an audio_channels property with the number of different channels, and an additional utterances property, containing a list of turn-by-turn utterances.
Each utterance contains channel information, starting at 1.
Additionally, each word in the words array contains the channel identifier.
Quickstart
Multichannel audio increases the transcription time by approximately 25%.