Skip to main content

Transcript

POST/v2/transcriptCreate a transcript using the parameters described below.
GET/v2/transcript/:idFetch the transcript object you created, using its id.
GET/v2/transcript/:id/sentencesOptionally, after your transcript is complete, get the sentences using its id.
GET/v2/transcript/:id/paragraphsOptionally, after your transcript is complete, get the paragraphs using its id.
GET/v2/transcript/:id/redacted-audioIf you enable PII redaction, get the redacted audio, as described in the Create a redacted audio file section.
GET/v2/transcript/:id/:subtitle_formatExport your completed transcripts in SRT (srt) or VTT (vtt) format, which can be used for subtitles and closed captions in videos.

The Transcript Object

idstringThe unique identifier of your transcription
statusstringThe status of your transcription. queued, processing, completed, or error
language_codestringThe language of your audio file. The default value is en_us.
audio_urlstringThe URL of your media file to transcribe
textstringThe text transcription of your media file
wordsarrayA list of all the individual words transcribed
utterancesarrayWhen dual_channel or speaker_labels is enabled, a list of turn-by-turn utterances
confidencefloatThe confidence our model has in the transcribed text, between 0.0 and 1.0
audio_durationfloatThe duration of your media file, in seconds
punctuatebooleanEnable Automatic Punctuation, can be true or false
format_textbooleanEnable Text Formatting, can be true or false
dual_channelbooleanEnable Dual Channel transcription, can be true or false
webhook_urlstringThe URL we should send webhooks to when your transcript is complete
webhook_auth_header_namestringDefaults to null. Optionally allows a user to specify a header name and value to send back with a webhook call for added security.
webhook_auth_header_valuestringDefaults to null. Optionally allows a user to specify a header name and value to send back with a webhook call for added security.
webhook_status_codestringThe status code we received from your server when delivering your webhook
auto_highlights_resultarrayThe list of results when enabling Automatic Transcript Highlights
audio_start_fromintegerThe point in time, in milliseconds, to begin transcription from in your media file
audio_end_atintegerThe point in time, in milliseconds, to stop transcribing in your media file
word_boostarrayA list of custom vocabulary to boost accuracy for
boost_paramstringThe weight to apply to words/phrases in the word_boost array; can be "low", "default", or "high"
filter_profanitybooleanFilter profanity from the transcribed text, can be true or false
redact_piibooleanRedact PII from the transcribed text, can be true or false
redact_pii_audiobooleanGenerate a copy of the original media file with spoken PII "beeped" out, can be true or false
redact_pii_policiesarrayThe list of PII Redaction policies to enable
redact_pii_substringThe replacement logic for detected PII, can be "entity_type" or "hash"
speaker_labelsbooleanEnable Speaker Diarization, can be true or false
speakers_expectedintegerDefaults to null. Tells the speaker label model how many speakers it should attempt to identify, up to 10.
content_safetybooleanEnable Content Moderation, can be true or false
iab_categoriesbooleanEnable Topic Detection, can be true or false
content_safety_labelsarrayThe list of results when content_safety is true
iab_categories_resultarrayThe list of results when iab_categories is true
custom_spellingarrayCustomize how words are spelled and formatted using to and from values
disfluenciesbooleanTranscribe Filler Words, like "umm", in your media file; can be true or false
sentiment_analysisbooleanEnable Sentiment Analysis, can be true or false
auto_chaptersbooleanEnable Auto Chapters, can be true or false
chaptersarrayWhen Auto Chapters is enabled, the list of Auto Chapters results
sentiment_analysis_resultsarrayWhen Sentiment Analysis is enabled, the list of Sentiment Analysis results
entity_detectionbooleanEnable Entity Detection, can be true or false
entitiesarrayWhen Entity Detection is enabled, the list of detected Entities
throttledbooleanTrue while a request is throttled and false when a request is no longer throttled

Create a transcript

audio_urlstringThe URL of your media file to transcribeYes
language_codestringThe language of your audio file. Possible values are found in Supported Languages. The default value is en_us.
punctuatebooleanEnable Automatic Punctuation, can be true or false
format_textbooleanEnable Text Formatting, can be true or false
dual_channelbooleanEnable Dual Channel transcription, can be true or false
webhook_urlstringThe URL we should send webhooks to when your transcript is complete
webhook_auth_header_namestringDefaults to null. Optionally allows a user to specify a header name and value to send back with a webhook call for added security.
webhook_auth_header_valuestringDefaults to null. Optionally allows a user to specify a header name and value to send back with a webhook call for added security.
audio_start_fromintegerThe point in time, in milliseconds, to begin transcription from in your media file
audio_end_atintegerThe point in time, in milliseconds, to stop transcribing in your media file
word_boostarrayA list of custom vocabulary to boost accuracy for
boost_paramstringThe weight to apply to words/phrases in the word_boost array; can be "low", "default", or "high"
filter_profanitybooleanFilter profanity from the transcribed text, can be true or false
redact_piibooleanRedact PII from the transcribed text, can be true or false
redact_pii_audiobooleanGenerate a copy of the original media file with spoken PII "beeped" out, can be true or false
redact_pii_audio_qualitystringControls the filetype of the audio created by redact_pii_audio. Currently supports mp3 (default) and wav.
redact_pii_policiesarrayThe list of PII Redaction policies to enable
redact_pii_substringThe replacement logic for detected PII, can be "entity_type" or "hash"
speaker_labelsbooleanEnable Speaker Diarization, can be true or false
speakers_expectedintegerDefaults to null. Tells the speaker label model how many speakers it should attempt to identify, up to 10.
content_safetybooleanEnable Content Moderation, can be true or false
iab_categoriesbooleanEnable Topic Detection, can be true or false
custom_spellingarrayCustomize how words are spelled and formatted using to and from values
disfluenciesbooleanTranscribe Filler Words, like "umm", in your media file; can be true or false
sentiment_analysisbooleanEnable Sentiment Analysis, can be true or false
auto_chaptersbooleanEnable Auto Chapters, can be true or false
entity_detectionbooleanEnable Entity Detection, can be true or false