Voice Search Partnership with Algolia

Share on social icon.Share on social icon.Share on social icon.Share on social icon.

This month we had one of our most anticipated launches with our friends, and the leaders in online search, Algolia. In addition, we had another model update that has shown significant improvements in accuracy and turnaround time (speed) for phone calls, videos, and podcasts. Finally, check out our two of our newest features focused around security: PII Audio Redaction and Profanity Filtering.

Voice Search with Algolia

Our friends at Algolia are known around the globe for their super fast and robust search that powers popular websites like Twitch, Stripe, and Lacoste. As a leader in search, Algolia understands the importance of giving Voice Search capabilities to their customers.

Since 71% of users prefer to search through voice compared to using a keyboard, we wanted to make it easy add voice search to your website or app in seconds. It’s in that spirit that we’re happy to announce that we’ve teamed up with Algolia to launch a plugin for their popular InstantSearch.js library. With this library, developers can add accurate Voice Search to their websites in just a few lines of code.

Voice Search API.png

See the full API documentation here

Further Accuracy Improvements

With our most recent update, we are consistently out-benchmarking all other speech-to-text API's in accuracy on our asynchronous English model. Below are our average accuracy %'s (using Word Error Rate or WER) based on benchmark reports run this month:


Speed Enhancements

Speech-to-text turnaround time was a big focus for us this month to improve speed for customers looking to transcribe webinars, video interviews, and conference calls - all longer forms of audio and video. You should notice a ~22% increase in speed across all of our models due to a couple of updates we made:

  • Optimizations in our model size
  • Improvements in our beam search

PII Redaction for Audio

In addition to PII redaction on transcripts, we now send back an updated audio URL that includes the same redactions being made in the audio clip. This means that your audio file will now silence any personal identifiable information (PII) like social security numbers and credit cards from playback using the new URL we send you.

Profanity Filter

Looking to remove curse words from your transcription text? Our new profanity filter allows you to quickly remove curse words and replace them with "#" characters in the transcript so you don't have to worry about it!

100% uptime

Another month of 100% uptime across all our models, subscribe to our status page to stay up-to-date!

Subscribe to our blog!

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

You may also like

Checkout some of our recent research and product updates

How to Convert an MP3 File to Text with an API
How to Convert an MP3 File to Text with an API

how to convert an mp3 file to text with an API

Is Word Error Rate a Good Measure of Speech Recognition Systems?
Deep Learning
Is Word Error Rate a Good Measure of Speech Recognition Systems?

What is Word Error Rate? Word Error Rate is a measure of how accurate an Automatic Speech Recognition (ASR) system performs. Quite literally, it calculates how many “errors” are in the transcription text produced by an ASR system, compared to a human transcription.

The State of Python Speech Recognition in 2021
The State of Python Speech Recognition in 2021

The State of Python Speech Recognition in 2021

build with assemblyai

Accurately convert your audio and video files to text with AssemblyAI's Speech-to-Text API