Deep Learning - News, Tutorials, AI Research

Decoding Strategies: How LLMs Choose The Next Word

Deep Learning

Aug 21, 2024

Decoding Strategies: How LLMs Choose The Next Word

Large Language Models are trained to guess the next word. But when generating text, the combination of their probability estimates with algorithms known as decoding strategies is what determines how they actually choose words. Learn how decoding strategies work in this article.

Marco Ramponi

Developer Educator

Florence-2: How it works and how to use it

Deep Learning

Jul 15, 2024

Florence-2: How it works and how to use it

Microsoft's Florence-2 is a foundational image model that can perform almost every common task in computer vision. Learn how Florence-2 works and how to use it in this guide.

Ryan O'Connor

Developer Educator

AI trends in 2024: Graph Neural Networks

Deep Learning

Feb 20, 2024

AI trends in 2024: Graph Neural Networks

From fundamental research to productionized AI models, let’s discover how this cutting-edge technology is powering production applications and may be shaping the future of AI.

Marco Ramponi

Developer Educator

AI for Universal Audio Understanding: Qwen-Audio Explained

Deep Learning

Dec 7, 2023

AI for Universal Audio Understanding: Qwen-Audio Explained

Recently, researchers have made progress towards universal audio understanding, marking an advancement towards foundational audio models. The approach is based on a joint audio-language pre-training that enhances performance without task-specific finetuning.

Marco Ramponi

Developer Educator

Combining Speech Recognition and Diarization in one model

Deep Learning

Oct 27, 2023

Combining Speech Recognition and Diarization in one model

A new approach towards multi-speaker speech processing integrates Speaker Diarization and Automatic Speech Recognition in a unified framework. We discuss the key insights from this recent exciting development in Speech AI research.

Marco Ramponi

Developer Educator

Deep Learning

Sep 29, 2023

How DALL-E 2 Actually Works

How does OpenAI's groundbreaking DALL-E 2 model actually work? Check out this detailed guide to learn the ins and outs of DALL-E 2.

Ryan O'Connor

Developer Educator

What AI Music Generators Can Do (And How They Do It)

Deep Learning

Sep 22, 2023

What AI Music Generators Can Do (And How They Do It)

Text-to-Music Models are advancing rapidly with the recent release of new platforms for AI-generated music. This guide focuses on MusicLM, MusicGen, and Stable Audio, exploring the technical breakthroughs and challenges in creating music with AI.

Marco Ramponi

Developer Educator

Deep Learning

Sep 5, 2023

Is Word Error Rate Useful?

What is Word Error Rate and is it a useful measurement of accuracy for speech recognition systems? In this article, we examine the answer to these questions, as well as explore other alternatives to Word Error Rate.

Dylan Fox

Founder, CEO

Residual Vector Quantization RVQ for Neural Compression

Deep Learning

Sep 4, 2023

What is Residual Vector Quantization?

Neural Audio Compression methods based on Residual Vector Quantization are reshaping the landscape of modern audio codecs. In this guide, learn the basic ideas behind RVQ and how it enhances Neural Compression.

Marco Ramponi

Developer Educator

RLHF vs RLAIF for language model alignment

Deep Learning

Aug 22, 2023

RLHF vs RLAIF for language model alignment

RLHF is the key method used to train AI assistants like ChatGPT, but it has strong limitations and can produce harmful outputs. RLAIF improves upon RLHF by using AI feedback. Learn the differences between the two methods and what these differences mean in practice in this guide.

Ryan O'Connor

Developer Educator

Why Language Models Became Large Language Models And The Hurdles In Developing LLM-based Applications

Deep Learning

Aug 18, 2023

Why Language Models Became Large Language Models And The Hurdles In Developing LLM-based Applications

What’s the difference between Language Models and Large Language Models? Let’s understand AI development trends and the difficulties of integrating LLMs into real-world applications.

Marco Ramponi

Developer Educator

How RLHF Models Works - Reinforcement Learning From Human Feedback

Deep Learning

Aug 3, 2023

How RLHF Preference Model Tuning Works (And How Things May Go Wrong)

Large Language Models like ChatGPT are trained with Reinforcement Learning From Human Feedback (RLHF) to learn human preferences. Let’s uncover how RLHF works and survey its current strongest limitations.

Marco Ramponi

Developer Educator

How Reinforcement Learning from AI Feedback works

Deep Learning

Aug 1, 2023

How Reinforcement Learning from AI Feedback works

Reinforcement Learning from AI Feedback (RLAIF) is a supervision technique that uses a "constitution" to make AI assistants like ChatGPT safer. Learn everything you need to know about RLAIF in this guide.

Ryan O'Connor

Developer Educator

Recent developments in Generative AI for Audio

Deep Learning

Jun 27, 2023

Recent developments in Generative AI for Audio

The spotlight has been on language and images for Generative AI, but there's been a lot of recent progress in the audio domain. Learn everything you need to know about generative audio models in this article.

Marco Ramponi

Developer Educator

How to evaluate Speech Recognition models

Deep Learning

Jun 15, 2023

How to evaluate Speech Recognition models

Speech Recognition models are key in extracting useful information from audio data. Learn how to properly evaluate speech recognition models in this easy-to-follow guide.

Ryan O'Connor

Developer Educator

Large Language Models for Product Managers: 5 Things to Know

Deep Learning

May 23, 2023

Large Language Models for Product Managers: 5 Things to Know

A Product Manager's guide to understanding Large Language Models and the building blocks of Conversational AI.

Marco Ramponi

Developer Educator

Introduction to Large Language Models for Generative AI

Popular

May 17, 2023

Introduction to Large Language Models for Generative AI

Generative AI language models like ChatGPT are changing the way humans and AI interact and work together, but how do these models actually work? Learn everything you need to know about modern Generative AI for language in this simple guide.

Ryan O'Connor

Developer Educator

Deep Learning

May 10, 2023

Modern Generative AI for images

Modern Generative AI models for images are powering a range of creative applications and changing the way we work. This guide will overview everything you need to know about these models and how they work.

Ryan O'Connor

Developer Educator

The Full Story of Large Language Models and RLHF

Deep Learning

May 3, 2023

The Full Story of Large Language Models and RLHF

Large Language Models have been in the limelight since the release of ChatGPT, with new models being announced seemingly every week. This guide walks through the essential ideas of how these models came to be.

Marco Ramponi

Developer Educator

Deep Learning

May 2, 2023

Introduction to Generative AI

Generative AI has made tremendous strides recently, from models like Stable Diffusion to ChatGPT. Get up to speed on the latest advancements with this easy-to-follow introduction to Generative AI.

Ryan O'Connor

Developer Educator

Everything you need to know about Generative AI

Deep Learning

May 2, 2023

Everything you need to know about Generative AI

Generative AI has taken the world by storm in the last several months, but what actually is Generative AI, and how does it work? Learn everything you need to know about Generative AI in this easy-to-follow series.

Ryan O'Connor, Marco Ramponi

Developer Educator, Developer Educator

Deep Learning

Apr 19, 2023

How physics advanced Generative AI

Many cutting-edge Generative AI models are inspired by concepts from physics. In this guide, we’ll take a high-level look at how physics is driving advancements in AI.

Ryan O'Connor

Developer Educator

Emergent Abilities of Large Language Models

Deep Learning

Mar 7, 2023

Emergent Abilities of Large Language Models

Emergence can be defined as the sudden appearance of novel behavior. Large Language Models apparently display emergence by suddenly gaining new abilities as they grow. Why does this happen, and what does this mean?

Ryan O'Connor

Developer Educator

AI research review – Locating and Editing Factual Associations in GPT

Deep Learning

Jan 18, 2023

AI research review – Locating and Editing Factual Associations in GPT

This week’s AI Research Review is Locating and Editing Factual Associations in GPT.

Gabriel Oexle

Deep Learning Researcher

Deep Learning

Dec 23, 2022

How ChatGPT actually works

Since its release, the public has been playing with ChatGPT and seeing what it can do, but how does ChatGPT actually work? While the details of its inner workings have not been published, we can piece together its functioning principles from recent research.

Marco Ramponi

Developer Educator