Tutorials

Guided tutorials on NLP, Machine Learning, Deep Learning, coding, and related topics.

Build Your Own Imagen Text-to-Image Model
Build Your Own Imagen Text-to-Image Model

Text-to-Image models have made great strides this year, from DALL-E 2 to the more recent Imagen model. In this tutorial learn how to build a minimal Imagen implementation - MinImagen.

Getting Started with ESPnet
Getting Started with ESPnet

ESPnet is the premier end-to-end, open-source speech processing toolkit. This easy-to-follow guide will help you get started using ESPnet for Speech Recognition.

AssemblyAI and Python in 5 Minutes
AssemblyAI and Python in 5 Minutes

Learn how to perform Automatic Speech Recognition in 5 minutes using Python and the AssemblyAI Speech-to-Text API with this simple tutorial.

How to Build a JavaScript Audio Transcript Application
How to Build a JavaScript Audio Transcript Application

Learn how to build a JavaScript Audio Transcript application using Node.js and Axios with this step-by-step beginner's guide.

MediaPipe for Dummies
MediaPipe for Dummies

With just a few lines of code, MediaPipe allows you to incorporate State-of-the-Art Machine Learning capabilities into your applications. Learn about MediaPipe and how to use its simple APIs in this beginner's guide.

JavaScript Text-to-Speech - The Easy Way
JavaScript Text-to-Speech - The Easy Way

Learn how to build a simple JavaScript Text-to-Speech application using JavaScript's Web Speech API in this step-by-step beginner's guide.

React Text to Speech - Simplified!
React Text to Speech - Simplified!

Learn how to create a simple React Text-to-Speech application with this step-by-step beginner's guide.

A Beginner's Guide to TorchStudio, The PyTorch IDE
A Beginner's Guide to TorchStudio, The PyTorch IDE

Learn how to build, train, and compare models with TorchStudio - the IDE built specifically for PyTorch.

Automate Meeting Notes with Python
Automate Meeting Notes with Python

Let's build a web app that receives the audio recording of a meeting and automatically generates meeting notes using AssemblyAI's Speech-to-Text API.

React Speech Recognition with React Hooks
React Speech Recognition with React Hooks

Learn how to build a React Speech Recognition app that transcribes your voice using the AssemblyAI API.

Transcribe Audio Files in an S3 Bucket with AssemblyAI
Transcribe Audio Files in an S3 Bucket with AssemblyAI

Learn how to transcribe audio files stored in an AWS S3 bucket with AssemblyAI in 3 simple steps.

Kaldi Install for Dummies
Kaldi Install for Dummies

A step-by-step Kaldi install tutorial so you can get up and running on your NLP projects as soon as possible.

Auto-Tweet Your Words Using Speech Recognition in Python
Auto-Tweet Your Words Using Speech Recognition in Python

We say the funniest things when no one is listening. But what if someone did, all the time? In this article, we will learn how to make an app that will listen to you all the time and Tweet the funniest, smartest or most relatable things you say out loud.

How To Convert Voice To Text Using JavaScript
How To Convert Voice To Text Using JavaScript

This article shows how Real-Time Speech Recognition from a microphone recording can be integrated into your JavaScript application in only a few lines of code.

Differentiable Programming - A Simple Introduction
Differentiable Programming - A Simple Introduction

What is Differentiable Programming, and how is it different from Deep Learning? Check out this introduction to learn everything you need to know!

How to do Speech-To-Text with Golang
How to do Speech-To-Text with Golang

This article shows how Speech Recognition can be integrated into your Golang application in only a few lines of code.

How to Build a Python Project that Summarizes Your Lectures
How to Build a Python Project that Summarizes Your Lectures

Learn how to build a Python app that lets you study faster by automatically summarizing lectures!

Transcribe Twilio Phone Calls in Real-Time with AssemblyAI
Transcribe Twilio Phone Calls in Real-Time with AssemblyAI

Learn how to use AssemblyAI's Speech-to-Text API to get accurate transcriptions during a Twilio call

Kaldi Speech Recognition for Beginners - A Simple Tutorial
Kaldi Speech Recognition for Beginners - A Simple Tutorial

Want to learn how to use Kaldi for Speech Recognition? Check out this simple tutorial to start transcribing audio in minutes.

Backpropagation For Neural Networks Explained
Backpropagation For Neural Networks Explained

In this Deep Learning tutorial, we learn about the Backpropagation algorithm for neural networks.

Jupyter Notebooks Tips and Tricks
Jupyter Notebooks Tips and Tricks

In this video, we learn how to use Jupyter notebooks, how to perform basic actions, what each indicator means, and some extra tips and tricks.

Introduction to Variational Autoencoders Using Keras
Introduction to Variational Autoencoders Using Keras

The complete guide to understanding and implementing Variational Autoencoders with Keras

What is GPT-3 and How Does It Work?
What is GPT-3 and How Does It Work?

In this video, we will learn why GPT-3 is so unique and how it manages to help bring in a new wave of excitement for AI.

Getting Started With Torchaudio
Getting Started With Torchaudio

In this PyTorch tutorial, we learn how to get started with Torchaudio and work with audio data.

Auto Chapters in Action - Build a Web App that Automatically Summarizes Podcasts
Auto Chapters in Action - Build a Web App that Automatically Summarizes Podcasts

In this video, we learn to build a streamlit app to summarize podcast episodes with the AssemblyAI Speech-to-Text API.