- Contextual prompting (
prompt) — a natural-language description of what the audio is about: the domain, the scenario, or the full details of the conversation. - Keyterms prompting (
keyterms_prompt) — an explicit list of terms you want the model to recognize accurately.
Contextual prompting
Use theprompt parameter to provide context about your audio — describe what is being transcribed, not how to transcribe it. Formatting and behavioral instructions are ignored. The model stays grounded in the audio, so irrelevant context won’t cause hallucinated words.
For example, this is a 2-second clip from a League of Legends pro interview:
Without prompt:
- Python
- Python SDK
- JavaScript
- JavaScript SDK
Prompting guide
Contextual prompts work at three levels of specificity. Use the least specific level that covers your use case, and add detail when your audio contains uncommon names or terms the model can’t otherwise know.| Level | Length | What it contains | Example |
|---|---|---|---|
| Domain | 2–5 words | The domain only | Medical consultation call. |
| Scenario | 5–15 words | What the conversation is about | Cardiology consultation about chest pain symptoms. |
| Detailed | 20–50 words | Full description, including names, products, or identifiers | Cardiology consultation between Dr. Smith and an elderly patient regarding recurring chest pain, ECG results, and medication adjustment for hypertension. |
- Write plain, complete sentences that describe the audio
- Keep it to one short block of text. Don’t pack lists of keywords into the contextual prompt
Keyterms prompting
Keyterms prompting allows you to provide up to 1,000 words or phrases (maximum 6 words per phrase) using thekeyterms_prompt parameter to improve transcription accuracy for those terms and related variations or contextually similar phrases.
Here is an example showing how you can use keyterms prompting to improve transcription accuracy for a name with distinctive spelling and formatting.
Without keyterms prompting:
- Python
- JavaScript
- Python SDK
- JavaScript SDK
Keyword count limitsWhile we support up to 1000 key words and phrases, actual capacity may be lower due to internal tokenization and implementation constraints.
Key points to remember:
- Each word in a multi-word phrase counts towards the 1000 keyword limit
- Capitalization affects capacity (uppercase tokens consume more than lowercase)
- Longer words consume more capacity than shorter words