Extract Transcript Quotes with LLM Gateway | AssemblyAI

This guide will demonstrate how to use AssemblyAI’s LLM Gateway to process an audio file and find the best quotes via the chat completions endpoint.

Quickstart

Python

JavaScript

1 import requests
2 import json
3 import time
4 
5 API_KEY = "YOUR_API_KEY"
6 headers = {"authorization": API_KEY}
7 
8 # Transcribe the audio file
9 
10 print("Submitting audio for transcription...")
11 transcript_response = requests.post(
12 "https://api.assemblyai.com/v2/transcript",
13 headers=headers,
14 json={
15 "audio_url": "https://assembly.ai/wildfires.mp3",
16 "speaker_labels": True
17 }
18 )
19 
20 transcript_id = transcript_response.json()["id"]
21 
22 # Poll for transcription completion
23 
24 while True:
25 transcript_result = requests.get(
26 f"https://api.assemblyai.com/v2/transcript/{transcript_id}",
27 headers=headers
28 ).json()
29 
30     if transcript_result["status"] == "completed":
31         break
32     elif transcript_result["status"] == "error":
33         raise Exception(f"Transcription failed: {transcript_result['error']}")
34 
35     time.sleep(3)
36 
37 # Extract utterances with timestamps
38 
39 utterances_data = [
40 {"text": u["text"], "start": u["start"], "end": u["end"], "speaker": u["speaker"]}
41 for u in transcript_result["utterances"]
42 ]
43 
44 # Create prompt with timestamped utterances
45 
46 prompt = f"""You are analyzing a transcript with timestamped utterances. Each utterance includes the text content, speaker label, and start/end timestamps in milliseconds.
47 
48 Here is the transcript data:
49 {json.dumps(utterances_data, indent=2)}
50 
51 Task: Identify the 3-5 most engaging, impactful, or quotable utterances from this transcript.
52 
53 Return your response as a JSON array with the following structure:
54 {{
55   "quotes": [
56     {{
57       "text": "exact quote text",
58       "start": start_timestamp_in_milliseconds,
59       "end": end_timestamp_in_milliseconds,
60       "speaker": "speaker_label",
61       "reason": "brief explanation of why this quote is engaging"
62     }}
63 ]
64 }}
65 
66 Return ONLY valid JSON, no additional text."""
67 
68 # Use LLM Gateway to extract quotes
69 
70 print("Submitting transcript to LLM Gateway for quote extraction...")
71 gateway_response = requests.post(
72 "https://llm-gateway.assemblyai.com/v1/chat/completions",
73 headers=headers,
74 json={
75 "model": "gpt-5-nano",
76 "messages": [
77 {"role": "user", "content": prompt}
78 ]
79 }
80 )
81 
82 result = gateway_response.json()
83 quotes_json = json.loads(result["choices"][0]["message"]["content"])
84 print(json.dumps(quotes_json, indent=2))

Getting Started

Before we begin, make sure you have an AssemblyAI account and an API key. You can sign up for an AssemblyAI account and get your API key from your dashboard.

Step-by-Step Instructions

Step 1: Set up your API key and headers

Python

JavaScript

1 import requests
2 import json
3 import time
4 
5 API_KEY = "YOUR_API_KEY"
6 headers = {"authorization": API_KEY}

Step 2: Transcribe the audio file

Next, we’ll use AssemblyAI to transcribe a file and save our transcript for later use. We’ll enable speaker_labels to get utterances grouped by speaker.

Python

JavaScript

1 # Transcribe the audio file
2 print("Submitting audio for transcription...")
3 transcript_response = requests.post(
4     "https://api.assemblyai.com/v2/transcript",
5     headers=headers,
6     json={
7         "audio_url": "https://assembly.ai/wildfires.mp3",
8         "speaker_labels": True
9     }
10 )
11 
12 transcript_id = transcript_response.json()["id"]
13 
14 # Poll for transcription completion
15 
16 while True:
17 transcript_result = requests.get(
18 f"https://api.assemblyai.com/v2/transcript/{transcript_id}",
19 headers=headers
20 ).json()
21 
22     if transcript_result["status"] == "completed":
23         break
24     elif transcript_result["status"] == "error":
25         raise Exception(f"Transcription failed: {transcript_result['error']}")
26 
27     time.sleep(3)

Step 3: Extract utterances with timestamps

Then we’ll take the timestamped utterances array from our transcript and format it as structured data. Utterances are grouped by speaker and include continuous speech segments.

Python

JavaScript

1 utterances_data = [
2     {"text": u["text"], "start": u["start"], "end": u["end"], "speaker": u["speaker"]} 
3     for u in transcript_result["utterances"]
4 ]

Step 4: Use LLM Gateway to extract engaging quotes

Finally, we’ll provide the timestamped utterances to the LLM Gateway chat completions endpoint to extract the most engaging quotes from this transcript with their associated timestamps in a structured JSON format.

Python

JavaScript

1 # Create prompt with timestamped utterances
2 prompt = f"""You are analyzing a transcript with timestamped utterances. Each utterance includes the text content, speaker label, and start/end timestamps in milliseconds.
3 
4 Here is the transcript data:
5 {json.dumps(utterances_data, indent=2)}
6 
7 Task: Identify the 3-5 most engaging, impactful, or quotable utterances from this transcript.
8 
9 Return your response as a JSON array with the following structure:
10 {{
11   "quotes": [
12     {{
13       "text": "exact quote text",
14       "start": start_timestamp_in_milliseconds,
15       "end": end_timestamp_in_milliseconds,
16       "speaker": "speaker_label",
17       "reason": "brief explanation of why this quote is engaging"
18     }}
19 ]
20 }}
21 
22 Return ONLY valid JSON, no additional text."""
23 
24 # Use LLM Gateway to extract quotes
25 
26 print("Submitting transcript to LLM Gateway for quote extraction...")
27 gateway_response = requests.post(
28 "https://llm-gateway.assemblyai.com/v1/chat/completions",
29 headers=headers,
30 json={
31 "model": "gpt-5-nano",
32 "messages": [
33 {"role": "user", "content": prompt}
34 ]
35 }
36 )
37 
38 result = gateway_response.json()
39 quotes_json = json.loads(result["choices"][0]["message"]["content"])
40 print(json.dumps(quotes_json, indent=2))

Example Response

1 {
2   "quotes": [
3     {
4       "text": "It is, it is. The levels outside right now in Baltimore are considered unhealthy. And most of that is due to what's called particulate matter, which are tiny particles, microscopic, smaller than the width of your hair, that can get into your lungs and impact your respiratory system, your cardiovascular system, and even your neurological, your brain.",
5       "start": 62350,
6       "end": 82590,
7       "speaker": "B",
8       "reason": "Defines particulate matter and explains how it harms health."
9     },
10     {
11       "text": "Yeah. So the concentration of particulate matter, I was looking at some of the monitors that we have was reaching levels of what are, in science speak, 150 micrograms per meter cubed, which is more than 10 times what the annual average should be in about four times higher than what you're supposed to have on a 24 hour average. And so the concentrations of these particles in the air are just much, much, much higher than we typically see. And exposure to those high levels can lead to a host of health problems.",
12       "start": 93550,
13       "end": 123350,
14       "speaker": "B",
15       "reason": "Gives specific concentration figures and links to health risks."
16     },
17     {
18       "text": "It's the youngest. So children, obviously, whose bodies are still developing, the elderly who are, you know, their bodies are more in decline and they're more susceptible to the health impacts of breathing, the poor air quality. And then people who have pre existing health conditions, people with respiratory conditions or heart conditions, can be triggered by high levels of air pollution.",
19       "start": 137610,
20       "end": 156650,
21       "speaker": "B",
22       "reason": "Highlights the most vulnerable groups affected by poor air quality."
23     },
24     {
25       "text": "Well, I think the fires are going to burn for a little bit longer. But the key for us in the US Is the weather system changing. Right now it's the weather systems that are pulling that air into our Mid Atlantic and Northeast region. As those weather systems change and shift, we'll see that smoke going elsewhere and not impact us in this region as much. I think that's going to be the defining factor. I think the next couple days we're going to see a shift in that weather pattern and start to push the smoke away from where we are.",
26       "start": 198280,
27       "end": 227480,
28       "speaker": "B",
29       "reason": "Offers an outlook on how weather patterns may reduce exposure."
30     },
31     {
32       "text": "I mean, that is one of the predictions for climate change. Looking into the future, the fire season is starting earlier and lasting longer and we're seeing more frequent fires. So yeah, this is probably something that we'll be seeing more, more frequently. This tends to be much more of an issue in the western U.S. so the eastern U.S. getting hit right now is a little bit new. But yeah, I think with climate change moving forward, this is something that is going to happen more frequently.",
33       "start": 241370,
34       "end": 267570,
35       "speaker": "B",
36       "reason": "Connects current event to longer-term climate change trends and future frequency."
37     }
38   ]
39 }