Skip to main content
Isolate the primary voice and suppress background noise on streaming transcription. Hear the speaker, not the room. Voice Focus isolates the primary voice and suppresses background chatter, keyboard clicks, fan hum, and room echo before the audio reaches the transcription model.

Variants

Pick the variant based on how close the speaker is to the microphone.
VariantValueWhen to use
Near fieldnear-fieldHeadsets, handsets, and other close-talking microphones.
Far fieldfar-fieldConference rooms, drive-thru speakers, laptop mics, and other distant capture setups.

Quickstart

Set the voice_focus connection parameter when you open the WebSocket. Optionally tune voice_focus_threshold, a float between 0.0 and 1.0, to control how aggressively background audio is suppressed. Higher values are more aggressive.
CONNECTION_PARAMS = {
    "sample_rate": 16000,
    "speech_model": "u3-rt-pro",
    "voice_focus": "near-field",
}