Welcome to the AssemblyAI Blog

Learn about our latest research and product updates
Fine-Tuning Transformers for NLP
Deep Learning
Tutorials
Fine-Tuning Transformers for NLP

Since being first developed and released in the Attention Is All You Need paper Transformers have completely redefined the field of Natural Language Processing. In this blog, we show you how to quickly fine-tune Transformers for numerous downstream tasks, that often perform really well out of the box.

Transcribing Zoom Recordings Using the Zoom API and AssemblyAI
Tutorials
Transcribing Zoom Recordings Using the Zoom API and AssemblyAI

In this post, we’re going to show you how to transcribe your Zoom recordings by connecting Zoom’s API with AssemblyAI’s automatic speech recognition API. In just a few lines of code, you'll see how you can accurately transcribe your Zoom recordings!

Open Sourcing Drone Deploy ECS
Engineering
Open Sourcing Drone Deploy ECS

We’re excited to announce that we’ve open sourced another project! `drone-deploy-ecs` is a Drone plugin that enables you to deploy updates to ECS. Our engineering team has recently made the decision to migrate from Docker on EC2 to AWS ECS. We knew that moving to ECS would require us to refactor our deployment processes, so we figured we’d wrap our deployment process into a single tool that fit into our CICD solution.

A New API Endpoint to Paginate Through Historical Transcripts
Changelog
A New API Endpoint to Paginate Through Historical Transcripts

Today we are excited to make available our new List Endpoint, which give developers the ability to query for, and paginate through, all of their historical transcriptions.

Redacting Sensitive Medical Information from Transcriptions
Changelog
Redacting Sensitive Medical Information from Transcriptions

AssemblyAI's Speech-to-Text API now supports automatically detecting and redacting medical information, like drug names, injuries, medical conditions, and medical procedures, from transcription text!

How to Transcribe an Audio File in C# using the AssemblyAI API
Tutorials
How to Transcribe an Audio File in C# using the AssemblyAI API

Over the course of this post, we'll be walking you through the process of uploading an audio file from your local machine to AssemblyAI, and submitting that audio file for transcription using C#.

How to transcribe an audio file with Python and AssemblyAI
Tutorials
How to Transcribe an Audio File with Python and AssemblyAI

In this tutorial, we show you how to easily transcribe an audio or video file using Python and the AssemblyAI API in just a few lines of code!

2 New Endpoints to Return Transcripts as Paragraphs and Sentences
Changelog
2 New Endpoints to Return Transcripts as Paragraphs and Sentences

We now have two new endpoints that allow you to pull a completed transcript broken into paragraphs, or should you desire to be more specific, sentences!

Open Sourcing our Drone CI/CD CloudWatch Auto Scaler
Engineering
Open Sourcing our Drone CI/CD CloudWatch Auto Scaler

At AssemblyAI, we use Drone as our primary CI/CD tool. It's dead simple to set up and operate which frees us up to build out our product. Recently we decided to figure out how to build a cost-effective, easily-scalable Drone worker fleet for our GPU instances.

Getting started with HttpClientFactory in C# and .NET 5
Engineering
Getting started with HttpClientFactory in C# and .NET 5

HttpClientFactory has been around the .NET ecosystem for a few years now. In this post we will look at 3 basic implementations of HttpClientFactory; basic, named, and typed.

Feature Announcement: Content Safety Detection
Announcements
Changelog
Content Safety Detection is now GA!

Automatically transcribe audio and video files, and surface sensitive content, such "Hate Speech" or "NSFW" content, found within the audio.

Changelog: New Speaker Diarization model released
Changelog
Improved Speaker Diarization Accuracy Released

We have released a new Diarization model. Speaker diarization is the process of partitioning an input audio stream into homogeneous segments according to the speaker identity.

Speedy, code-free speech-to-text with AssemblyAI and Postman
Tutorials
Speedy, code-free speech-to-text with AssemblyAI and Postman

This article takes the reader through the process of setting up Postman, an API testing tool, to upload an audio file, then kick off a speech-to-text transcription and then download the completed transcription.

Uploading files to AssemblyAI using Node.js and JavaScript
Tutorials
Using AssemblyAI and Node.js to Transcribe Local Audio Files

In this blog, we look at how to upload a file from the local disk to AssemblyAI ready for transcription using Node.js and JavaScript

Getting started with speech-to-text transcriptions with AssemblyAI, JavaScript and Node.js
Tutorials
Getting started with speech-to-text transcriptions with AssemblyAI, JavaScript and Node.js

Build a command-line speech-to-text app with Node.js, JavaScript and AssemblyAI

New Punctuation and casing model
Changelog
New Punctuation and Casing model released 🎉

We’ve been hard at work at AssemblyAI dramatically improving our speech-to-text features and this week sees an update to our punctuation and casing features with a brand new model.

A Comparison of End-to-End Speech Recognition Architectures in 2021
Deep Learning
A Survey on End-To-End Speech Recognition Architectures in 2021

As part of our core research and development efforts to continue pushing the state of the art of speech recognition accuracy, in this post, we explore speech recognition architectures that are gaining new popularity in both academia and industry settings.

Building an end-to-end Speech Recognition model in PyTorch
Deep Learning
Building an end-to-end Speech Recognition model in PyTorch

The complete guide on how to build an end-to-end Speech Recognition model in PyTorch. Train your own CTC Deep Speech model using this tutorial.

PII Redaction and Speech-to-Text Accuracy Improvements
Changelog
PII Redaction and Accuracy Improvements

Last month, we introduced PII Redaction Policies, as part of a big overhaul to our PII Redaction feature to make it more flexible and powerful for you to specify exactly what you want redacted from your transcript

WhatConverts Call Tracking | AssemblyAI Speech-to-Text API Case Study
Case Studies
WhatConverts | AssemblyAI Case Study

WhatConverts call tracking platform partners with AssemblyAI Speech-to-Text API to power State-of-the-Art transcription accuracy.

AssemblyAI Awarded Best Speech-to-Text API API of 2020
Announcements
AssemblyAI Wins Best Public API 2020 🏆

Nordic APIs announces AssemblyAI Speech-to-Text API wins 2020 API of the year. Offering a high accuracy speech-to-text API built for developers.

PII Redaction for Speech-to-Text
Announcements
PII Redaction Policies for Speech-to-Text

Advanced PII Redaction from AssemblyAI Speech-to-Text API with customizable redaction rules and over 15 types of PII detected and redacted.

 AssemblyAI Speech-to-Text API - The State of Speaker Diarization
Deep Learning
Speaker Diarization: Speaker Labels for Mono Channel Files

AssemblyAI Speech-to-Text API's speaker diarization (diarisation) is the process of splitting audio or video inputs automatically based on the speaker's identity. It helps you answer the question "who spoke when?".

Voice Search Partnership with Algolia
Announcements
Voice Search Partnership with Algolia

AssemblyAI Speech-to-Text API update includes the launch of Voice Search with Algolia, improved accuracy, and enhanced speed.

Conversation Intelligence with CallRail and AssemblyAI
Announcements
[Webinar] Conversation Intelligence with CallRail

CallRail and AssemblyAI Speech-to-Text API discuss how conversation intelligence transforms call data into insights that boost sales performance and conversions.

Algolia Voice Search with AssemblyAI Speech-to-Text API
Announcements
Teaming up with Algolia for effortless Voice Search

AssemblyAI and Algolia launch a plugin for their popular InstantSearch.js library. With this library, developers can add accurate Voice Search to their websites in just a few lines of code.

AssemblyAI Speech-to-Text API Improved Accuracy WER (Word Error Rate)
Announcements
Improved WER - Speech-to-Text Accuracy vs. Google, AWS (May 2020 Update)

AssemblyAI launches significant improvements to transcription accuracy, consistently out-benchmarking other providers like Google Cloud Speech-to-Text, AWS Transcribe, and Rev.ai.

AssemblyAI Speech-to-Text API | Automated SRT and VTT Video Captions
Announcements
Automated SRT and VTT Video Captions (April 2020 Update)

Automated captions for videos in SRT and VTT format using AssemblyAI Speech-to-Text API.

PII Redaction for Speech-to-Text Transcriptions
Announcements
PII Redaction for Speech-to-Text Transcriptions (March 2020 Update)

AssemblyAI Speech-to-Text API launches PII Redaction for Transcriptions on audio, video, and podcasts.

ADVANCED TRANSCRIPTON FEATURES

Unlock your media with our advanced features like PII Redaction,
Keyword Boosts, Automatic Transcript Highlights, and more