Tutorials
48 posts
Guided tutorials on NLP, Machine Learning, Deep Learning, coding, and related topics.
View all
AssemblyAI and Python in 5 Minutes
Learn how to perform Automatic Speech Recognition in 5 minutes using Python and the AssemblyAI Speech-to-Text API with this simple tutorial.

How to Build a JavaScript Audio Transcript Application
Learn how to build a JavaScript Audio Transcript application using Node.js and Axios with this step-by-step beginner's guide.

MediaPipe for Dummies
With just a few lines of code, MediaPipe allows you to incorporate State-of-the-Art Machine Learning capabilities into your applications. Learn about MediaPipe and how to use its simple APIs in this beginner's guide.
Deep Learning
38 posts
Deep dives into NLP, Machine Learning, Deep Learning, AI, coding, and other topics.
View all
Introduction to Diffusion Models for Machine Learning
The meteoric rise of Diffusion Models is one of the biggest developments in Machine Learning in the past several years. Learn everything you need to know about Diffusion Models in this easy-to-follow guide.

How DALL-E 2 Actually Works
How does OpenAI's groundbreaking DALL-E 2 model actually work? Check out this detailed guide to learn the ins and outs of DALL-E 2.

Review - ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
This week’s Deep Learning Paper Review is ALBERT: A Lite BERT for Self-supervised Learning of Language Representations.

AssemblyAI Recognized as G2 High Performer, Momentum Leader in Voice Recognition Software for Spring 2022
We are excited to announce that AssemblyAI has once again been recognized as a High Performer and Momentum Leader in Voice Recognition for Spring 2022 on G2.com.

Announcing Our $28M Series A Led by Accel
We’re excited to share that we’ve raised a $28M Series A led by Accel, with participation from a great group of investors including Y Combinator, John and Patrick Collison, Nat Friedman, and Daniel Gross.

DeltaHacks - AssemblyAI at McMaster University Hackathon
This past weekend, AssemblyAI was a proud sponsor of DeltaHacks, an annual student-run hackathon at McMaster University.
Videos
30 posts
Guided tutorials and explainer videos on Machine Learning, Deep Learning, and Artificial Intelligence topics.
View all
Learn How To Get Started with OpenAI API and GPT-3
Learn how to get started with the OpenAI API and GPT-3 in Python.

What is Gradient Clipping for Neural Networks?
In this video, we will learn about Gradient Clipping, a technique to tackle the exploding gradients problem in Neural Networks.

Hyperparameters of Neural Networks
In this video, we take a high-level look on all main hyperparameters of Neural Networks.
Industry
25 posts
Speech recognition, Speech-to-Text transcription, Audio Intelligence, and other industry-related topics.
View all
Text Summarization for NLP: 5 Best APIs in 2022
In this article, we’ll discuss what exactly text summarization is, how it works, a few of the best Text Summarization APIs, and some of its use cases.

Building an Intelligent Cloud-based Contact Center? How ASR, NLP, and NLU Tools Can Help
Fifty-six percent of contact centers plan to invest in AI technology. But choosing the right AI tech is key to success today and for safeguarding that success into the future. Whether it’s to optimize workflows that increase agent productivity, automate QA management, or support more efficient agent training, many

How to Optimize Video Editing Platforms with ASR, NLP, and NLU Tools
Learn how Artificial Intelligence, Deep Learning, and Machine Learning backed tools–like Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Natural Language Understanding (NLU)--help create industry-best video editing platforms.
Case Studies
9 posts
Innovative companies and developer projects using AssemblyAI’s Speech-to-Text Core Transcription and Audio Intelligence APIs.
View all
Built with AssemblyAI - Real-time Speech-to-Image Generation
In our Built with AssemblyAI series, we showcase projects built with the AssemblyAI Core Transcription and Audio Intelligence APIs. This Real-time Speech-to-Image Generation project was built by students at ASU HACKML 2022.

Built with AssemblyAI - YouTube Transcripts
In our Built with AssemblyAI series, we're featuring YouTube Transcripts, a platform that generates transcripts for YouTube videos with one click.

Built with AssemblyAI - Wordcab
In our Built with AssemblyAI series, we showcase developer projects, innovative companies, and impressive products created using the AssemblyAI Speech-to-Text transcription API and/or our Audio Intelligence API. Our latest post features Wordcab.

8/31/2021 AWS Outage Post-Mortem
On Tuesday, August 31st, AWS had an outage in their us-west-2 region. At 18:00 UTC that day, we experienced an increase in 5xx error codes returned by our API, as well as a slowdown in transcription turnaround time.

Open Sourcing Drone Deploy ECS
We’re excited to announce that we’ve open sourced another project! drone-deploy-ecs is a Drone plugin that enables you to deploy updates to ECS.

Open Sourcing our Drone CI/CD CloudWatch Auto Scaler
At AssemblyAI, we use Drone as our primary CI/CD tool. It's dead simple to set up and operate which frees us up to build out our product. Recently we decided to figure out how to build a cost-effective, easily-scalable Drone worker fleet for our GPU instances.