Multichannel Transcription | AssemblyAI

Supported Languages, Regions, and Models

Multichannel transcription is supported for all languages, regions, and models.

If you have a multichannel audio file with multiple speakers, you can transcribe each of them separately.

The response includes an audio_channels property with the number of different channels, and an additional utterances property, containing a list of turn-by-turn utterances.

Each utterance contains channel information, starting at 1.

Additionally, each word in the words array contains the channel identifier.

Quickstart

1 import assemblyai as aai
2 
3 aai.settings.api_key = "<YOUR_API_KEY>"
4 
5 # audio_file = "./local_file.mp3"
6 audio_file = "https://assembly.ai/wildfires.mp3"
7 
8 config = aai.TranscriptionConfig(multichannel=True)
9 
10 transcript = aai.Transcriber(config=config).transcribe(audio_file)
11 
12 if transcript.status == "error":
13   raise RuntimeError(f"Transcription failed: {transcript.error}")
14 
15 for utterance in transcript.utterances:
16   print(f"Channel {utterance.speaker}: {utterance.text}")

Multichannel audio increases the transcription time by approximately 25%.