Easily transcribe your audio or video files across multiple APIs, like AssemblyAI, Google, and AWS, and compare the transcripts and accuracy, all for free.GET STARTED NOW
Shopping for speech-to-text can be tricky. Every provider claims they have the "best" accuracy. Instead of promising, we'd like to show you first hand how well we perform. We provide all raw transcripts in case you'd like to run any additional internal benchmarking.
Word Error Rate (WER) is the industry standard for calculating the accuracy of an automatic speech recognition system. The WER compares the predicted transcription to a human transcription for an audio file, and counts the number of insertions, deletions, and substitutions made by the automatic speech recognition system in order to derive the WER. TLDR - we compare API transcripts with human transcripts to calculate Word Error Rate (WER).
Get a quick reference of how our speech recognition results compare to other APIs like Google and AWS Transcribe
In this report, we look at 5 different earning calls from various companies, and review how accurately AssemblyAI, AWS Transcribe, and Google Speech-to-Text are able to automatically transcribe these recordings.
We scoured the internet for a wide variety of media content—from news broadcasts on current events, to user generated social videos, and public podcasts - and compared AssemblyAI's transcription accuracy to Google Speech-to-Text and Amazon Transcribe.