Universal-3 Pro Streaming, the most accurate real-time transcription model for voice agents, is now live! Learn more

PlaygroundChangelogCommunitySign In
OverviewAPI ReferencePre-recorded STTStreaming STTSpeech UnderstandingGuardrailsLLM Gateway
OverviewAPI ReferencePre-recorded STTStreaming STTSpeech UnderstandingGuardrailsLLM Gateway
  • Models & features
    • Getting started
    • Speaker identification
    • Translation
    • Custom formatting
    • Entity detection
    • Sentiment analysis
    • Auto chapters
    • Key phrases
    • Topic detection
    • Summarization
Models & features

Speech Understanding

Extract structured insights from audio with Speech Understanding models.

Speech Understanding models analyze your transcripts to extract meaningful information like speaker identities, sentiment, topics, and summaries.

Speaker Identification

Identify and label speakers in your audio to attribute speech to the correct person.

Translation

Translate transcripts into other languages.

Custom Formatting

Customize how your transcript text is formatted.

Entity Detection

Detect and classify entities like names, locations, and organizations in your transcripts.

Sentiment Analysis

Detect the sentiment of each sentence in your transcript.

Auto Chapters

Automatically segment your transcript into chapters with summaries.

Key Phrases

Extract the most important phrases and words from your transcript.

Topic Detection

Detect topics discussed in your audio using the IAB taxonomy.

Summarization

Generate summaries of your transcripts in different formats.

Was this page helpful?

Speaker Identification

Next
Built with
LogoLogo
PlaygroundChangelogCommunitySign In