no-chatbot

How to identify languages in audio data using Python
How to identify languages in audio data using Python

Learn how to use Python to automatically identify languages in audio files.

How to perform Speaker Diarization in Python
How to perform Speaker Diarization in Python

Learn how to use Python to perform speaker diarization on audio and video files to identify "who said what when"

Speaker diarization vs speaker recognition - what's the difference?
Speaker diarization vs speaker recognition - what's the difference?

Learn the differences between speaker diarization and speaker recognition, as well as speaker verification and speaker identification in audio analysis

Build a Discord Voice Bot to Add ChatGPT to Your Voice Channel
Build a Discord Voice Bot to Add ChatGPT to Your Voice Channel

Build a sophisticated Discord voice bot that leverages AssemblyAI for speech transcription, OpenAI's GPT-3.5 Turbo AI model for intelligent processing, and ElevenLabs for speech synthesis.

Analyze Audio from Zoom Calls with AssemblyAI and Node.js
Analyze Audio from Zoom Calls with AssemblyAI and Node.js

Learn how to analyze audio from Zoom calls using AssemblyAI and Node.js.

Decoding Strategies: How LLMs Choose The Next Word
Decoding Strategies: How LLMs Choose The Next Word

Large Language Models are trained to guess the next word. But when generating text, the combination of their probability estimates with algorithms known as decoding strategies is what determines how they actually choose words. Learn how decoding strategies work in this article.

The Best Audio File Formats for Speech-to-Text: A Guide
The Best Audio File Formats for Speech-to-Text: A Guide

Learn about the best audio and video formats for speech-to-text applications, as well as best practices for audio post-processing techniques.

Get started using Claude 3.5 Sonnet with audio data
Get started using Claude 3.5 Sonnet with audio data

Learn how to use the Claude 3 models with audio and video data in Python.

Florence-2: How it works and how to use it
Florence-2: How it works and how to use it

Microsoft's Florence-2 is a foundational image model that can perform almost every common task in computer vision. Learn how Florence-2 works and how to use it in this guide.

How to Create a Real-Time Language Translation Service with AssemblyAI and DeepL in JavaScript
How to Create a Real-Time Language Translation Service with AssemblyAI and DeepL in JavaScript

Learn how to translate speech in real-time in JavaScript with AssemblyAI and DeepL.

Create Multi-Lingual Subtitles with AssemblyAI and DeepL
Create Multi-Lingual Subtitles with AssemblyAI and DeepL

Learn how to build a web app in Go that'll use AssemblyAI to transcribe an uploaded video file and generate subtitles.

Build an AI-powered video conferencing app with Next.js and Stream
Build an AI-powered video conferencing app with Next.js and Stream

Learn how to build a Next.js video conferencing app that supports video calls with live transcriptions and an LLM-powered meeting assistant.

How to Do Hotword Detection with Streaming Speech-to-Text and Go
How to Do Hotword Detection with Streaming Speech-to-Text and Go

In this tutorial, you'll learn how to respond to hotwords in voice data using Streaming Speech-to-Text in Go.

Content moderation on audio files with Python
Content moderation on audio files with Python

Modern AI models make it easy to automatically detect the presence of sensitive topics in speech data. Learn how to perform configurable content moderation with Python in this tutorial.

Filter profanity from audio files using Python
Filter profanity from audio files using Python

Learn how to filter profanity out of audio and video files with fewer than 10 lines of code in this tutorial

Introducing the AssemblyAI integration for LangChain Go
Introducing the AssemblyAI integration for LangChain Go

Learn how to use audio transcripts in LangChain Go using the AssemblyAI document loader.

Transcribe audio and video files with Python and Universal-1
Transcribe audio and video files with Python and Universal-1

Learn how to transcribe audio and video files in your Python applications with AssemblyAI's Universal-1 speech recognition model.

Transcribe an audio file with Universal-1 in Node.js
Transcribe an audio file with Universal-1 in Node.js

Learn how to transcribe an audio file in your Node.js applications with industry-leading accuracy using Universal-1.

Transcribe an audio file with Universal-1 using Go
Transcribe an audio file with Universal-1 using Go

Learn how to transcribe an audio file in your Go applications with industry-leading accuracy using Universal-1.

Automatically redact PII from audio and video with Python
Automatically redact PII from audio and video with Python

In this tutorial, we’ll learn how to automatically redact Personal Identifiable Information (PII) from audio and video files in 5 minutes using Python and AssemblyAI.

Detect scam calls using Go with LeMUR and Twilio
Detect scam calls using Go with LeMUR and Twilio

Learn how to detect scam attempts in phone calls, using LeMUR.

AI trends in 2024: Graph Neural Networks
AI trends in 2024: Graph Neural Networks

From fundamental research to productionized AI models, let’s discover how this cutting-edge technology is powering production applications and may be shaping the future of AI.

Transcribe a phone call in real-time using Python with AssemblyAI and Twilio
Transcribe a phone call in real-time using Python with AssemblyAI and Twilio

Learn how to transcribe a phone call in real-time using Python, AssemblyAI, ngrok, and Twilio

Transcribe phone calls in real-time in Go with Twilio and AssemblyAI
Transcribe phone calls in real-time in Go with Twilio and AssemblyAI

Learn how to transcribe a Twilio phone call in real-time using Go in this tutorial.

Announcing the AssemblyAI Go SDK
Announcing the AssemblyAI Go SDK

We're excited to announce the release of our AssemblyAI SDK for Go!