Can I use speaker diarization with Streaming Speech-to-Text? | AssemblyAI

Streaming diarization can be accomplished by using multichannel audio (i.e., an audio stream for each speaker). A separate session is created for each speaker and a single transcript is created from those sessions. Check out this section of our documentation to learn more about this approach.

Learn More About Streaming

For more information about our Streaming API and how to implement it in your projects, please refer to our detailed documentation:

Streaming API Documentation

Future Updates

We continuously work on improving our services. For the latest updates on our features, including any developments in real-time speaker diarization, please check our Changelog.