Auto Chapters | AssemblyAI

Supported languages

Global Englishen

Australian Englishen_au

British Englishen_uk

US Englishen_us

The Auto Chapters model summarizes audio data over time into chapters. Chapters makes it easy for users to navigate and find specific information.

Each chapter contains the following:

Summary
One-line gist
Headline
Start and end timestamps

Auto Chapters and Summarization

You can only enable one of the Auto Chapters and Summarization models in the same transcription.

Quickstart

Python SDK

Python

JavaScript SDK

JavaScript

C#

Ruby

PHP

Enable Auto Chapters by setting auto_chapters to True in the transcription config. punctuate must be enabled to use Auto Chapters (punctuate is enabled by default).

1 import assemblyai as aai
2 
3 aai.settings.api_key = "<YOUR_API_KEY>"
4 
5 # audio_file = "./local_file.mp3"
6 audio_file = "https://assembly.ai/wildfires.mp3"
7 
8 config = aai.TranscriptionConfig(auto_chapters=True)
9 
10 transcript = aai.Transcriber().transcribe(audio_file, config)
11 print(f"Transcript ID:", transcript.id)
12 
13 for chapter in transcript.chapters:
14   print(f"{chapter.start}-{chapter.end}: {chapter.headline}")

Example output

1 250-28840: Smoke from hundreds of wildfires in Canada is triggering air quality alerts across US
2 29610-280340: High particulate matter in wildfire smoke can lead to serious health problems

Auto Chapters Using LeMUR

Check out this cookbook Creating Chapter Summaries for an example of how to leverage LeMUR’s custom text input parameter for chapter summaries.

API reference

Request

$ curl https://api.assemblyai.com/v2/transcript \
> --header "Authorization: <YOUR_API_KEY>" \
> --header "Content-Type: application/json" \
> --data '{
>   "audio_url": "YOUR_AUDIO_URL",
>   "auto_chapters": true
> }'

Key	Type	Description
`auto_chapters`	boolean	Enable Auto Chapters.

Response

Key	Type	Description
`chapters`	array	An array of temporally sequential chapters for the audio file.
`chapters[i].gist`	string	An short summary in a few words of the content spoken in the i-th chapter.
`chapters[i].headline`	string	A single sentence summary of the content spoken during the i-th chapter.
`chapters[i].summary`	string	A one paragraph summary of the content spoken during the i-th chapter.
`chapters[i].start`	number	The starting time, in milliseconds, for the i-th chapter.
`chapters[i].end`	number	The ending time, in milliseconds, for the i-th chapter.

The response also includes the request parameters used to generate the transcript.

Frequently asked questions

Can I specify the number of chapters to be generated by the Auto Chapters model?

No, the number of chapters generated by the Auto Chapters model isn’t configurable by the user. The model automatically segments the audio file into logical chapters as the topic of conversation changes.

Troubleshooting

Why am I not getting any chapter predictions for my audio file?

One possible reason is that the audio file doesn’t contain enough variety in topic or tone for the model to identify separate chapters. Another reason could be due to background noise or low-quality audio interfering with the model’s analysis.