Ryan O'Connor - News, Tutorials, AI Research

How to perform Speaker Diarization in Python

Tutorials

Sep 10, 2024

How to perform Speaker Diarization in Python

Learn how to use Python to perform speaker diarization on audio and video files to identify "who said what when"

Ryan O'Connor

Developer Educator

Speaker diarization vs speaker recognition - what's the difference?

Industry

Sep 9, 2024

Speaker diarization vs speaker recognition - what's the difference?

Learn the differences between speaker diarization and speaker recognition, as well as speaker verification and speaker identification in audio analysis

Ryan O'Connor

Developer Educator

Florence-2: How it works and how to use it

Deep Learning

Jul 15, 2024

Florence-2: How it works and how to use it

Microsoft's Florence-2 is a foundational image model that can perform almost every common task in computer vision. Learn how Florence-2 works and how to use it in this guide.

Ryan O'Connor

Developer Educator

Speaker diarization improvements: new languages, increased accuracy

Announcements

Jun 20, 2024

Speaker diarization improvements: new languages, increased accuracy

Announcing several improvements to our Speaker Diarization service, yielding a more accurate model that's available in more languages.

Ryan O'Connor

Developer Educator

Content moderation on audio files with Python

Tutorials

May 27, 2024

Content moderation on audio files with Python

Modern AI models make it easy to automatically detect the presence of sensitive topics in speech data. Learn how to perform configurable content moderation with Python in this tutorial.

Ryan O'Connor

Developer Educator

Filter profanity from audio files using Python

Tutorials

May 22, 2024

Filter profanity from audio files using Python

Learn how to filter profanity out of audio and video files with fewer than 10 lines of code in this tutorial

Ryan O'Connor

Developer Educator

Automatically redact PII from audio and video with Python

Tutorials

Mar 18, 2024

Automatically redact PII from audio and video with Python

In this tutorial, we’ll learn how to automatically redact Personal Identifiable Information (PII) from audio and video files in 5 minutes using Python and AssemblyAI.

Ryan O'Connor

Developer Educator

Transcribe a phone call in real-time using Python with AssemblyAI and Twilio

Tutorials

Feb 15, 2024

Transcribe a phone call in real-time using Python with AssemblyAI and Twilio

Learn how to transcribe a phone call in real-time using Python, AssemblyAI, ngrok, and Twilio

Ryan O'Connor

Developer Educator

Lower latency, lower cost, more possibilities

Announcements

Jan 10, 2024

Lower latency, lower cost, more possibilities

We’re excited to introduce major improvements to our API’s inference latency, with the majority of audio files now completing in well under 45 seconds regardless of audio duration.

Ryan O'Connor

Developer Educator

Extract phone call insights with LLMs in Python

Tutorials

Nov 30, 2023

Extract phone call insights with LLMs in Python

Learn how to automatically extract insights from customer calls with Large Language Models (LLMs) and Python.

Ryan O'Connor

Developer Educator

Automatically determine video sections with AI using Python

Tutorials

Nov 7, 2023

Automatically determine video sections with AI using Python

In this tutorial, we will learn how to automatically determine video sections, how to generate section titles with LLMs, and how to format the information for YouTube chapters.

Ryan O'Connor

Developer Educator

Tutorials

Oct 6, 2023

Real-time transcription in Python

Learn how to perform real-time transcription on audio streams using Python in this tutorial.

Ryan O'Connor

Developer Educator

Deep Learning

Sep 29, 2023

How DALL-E 2 Actually Works

How does OpenAI's groundbreaking DALL-E 2 model actually work? Check out this detailed guide to learn the ins and outs of DALL-E 2.

Ryan O'Connor

Developer Educator

Retrieval Augmented Generation on audio data with LangChain and Chroma

Tutorials

Sep 26, 2023

Retrieval Augmented Generation on audio data with LangChain and Chroma

Retrieval Augmented Generation (RAG) allows you to add relevant documents as context when querying LLMs. Learn how to perform RAG on audio data using LangChain and Chroma in this tutorial.

Ryan O'Connor

Developer Educator

How to get Zoom Transcripts with the Zoom API

Tutorials

Sep 14, 2023

How to get Zoom Transcripts with the Zoom API

In this tutorial, we'll learn how to get Zoom transcripts using the Zoom API using Python.

Ryan O'Connor

Developer Educator

Convert Speech to Text in Python in 5 Minutes

Tutorials

Sep 6, 2023

Convert Speech to Text in Python in 5 Minutes

Learn how to perform Automatic Speech Recognition in 5 minutes using Python and the AssemblyAI Speech-to-Text API with this simple tutorial.

Ryan O'Connor

Developer Educator

How to build an interactive lecture summarization app

Tutorials

Aug 31, 2023

How to build an interactive lecture summarization app

In this tutorial, we’ll learn how to build an application that automatically summarizes a lecture and lets you ask questions about the lecture material.

Ryan O'Connor

Developer Educator

RLHF vs RLAIF for language model alignment

Deep Learning

Aug 22, 2023

RLHF vs RLAIF for language model alignment

RLHF is the key method used to train AI assistants like ChatGPT, but it has strong limitations and can produce harmful outputs. RLAIF improves upon RLHF by using AI feedback. Learn the differences between the two methods and what these differences mean in practice in this guide.

Ryan O'Connor

Developer Educator

Automatic summarization with LLMs in Python

Tutorials

Aug 15, 2023

Automatic summarization with LLMs in Python

Learn how to perform automatic summarization with Python using LLMs in this easy-to-follow tutorial.

Ryan O'Connor

Developer Educator

How Reinforcement Learning from AI Feedback works

Deep Learning

Aug 1, 2023

How Reinforcement Learning from AI Feedback works

Reinforcement Learning from AI Feedback (RLAIF) is a supervision technique that uses a "constitution" to make AI assistants like ChatGPT safer. Learn everything you need to know about RLAIF in this guide.

Ryan O'Connor

Developer Educator

How to evaluate Speech Recognition models

Deep Learning

Jun 15, 2023

How to evaluate Speech Recognition models

Speech Recognition models are key in extracting useful information from audio data. Learn how to properly evaluate speech recognition models in this easy-to-follow guide.

Ryan O'Connor

Developer Educator

Introduction to Large Language Models for Generative AI

Popular

May 17, 2023

Introduction to Large Language Models for Generative AI

Generative AI language models like ChatGPT are changing the way humans and AI interact and work together, but how do these models actually work? Learn everything you need to know about modern Generative AI for language in this simple guide.

Ryan O'Connor

Developer Educator

Deep Learning

May 10, 2023

Modern Generative AI for images

Modern Generative AI models for images are powering a range of creative applications and changing the way we work. This guide will overview everything you need to know about these models and how they work.

Ryan O'Connor

Developer Educator

Deep Learning

May 2, 2023

Introduction to Generative AI

Generative AI has made tremendous strides recently, from models like Stable Diffusion to ChatGPT. Get up to speed on the latest advancements with this easy-to-follow introduction to Generative AI.

Ryan O'Connor

Developer Educator

Everything you need to know about Generative AI

Deep Learning

May 2, 2023

Everything you need to know about Generative AI

Generative AI has taken the world by storm in the last several months, but what actually is Generative AI, and how does it work? Learn everything you need to know about Generative AI in this easy-to-follow series.

Ryan O'Connor, Marco Ramponi

Developer Educator, Developer Educator