How CBB Labs enabled a dance studio to run 4× faster with AssemblyAI's Voice AI

CBB Labs built Sangs, a voice agent powered by AssemblyAI, for DanceHall Studios, a studio-management platform for South Asian classical dance schools. With Sangs, one studio now completes 4× more daily work and saves 6 hours weekly.

more studio actions per week after moving from text to voice

hours won back every week

90%

fewer dashboard clicks

The load around the teaching

Shivakami, director of Temple of Fine Arts in Texas, has been a classical dancer for over 30 years and has taught for more than a decade. Her studio serves close to 100 students across 13 courses, with students distributed globally.

All the operational work falls on her: class emails to every cohort, course setup, uploading practice videos individually, pausing and resuming student subscriptions, and managing monthly billing. Without dedicated administrative support, these tasks consume time she would otherwise spend with students.

From typing to talking

DanceHall's dashboard provided visibility into student progress, attendance, and revenue through a modern SaaS interface. Text-based agentic actions added a layer of convenience, but interacting through chat still required thinking, typing, and waiting. In many cases, the time investment was comparable to clicking through the dashboard directly.

When voice agentic actions launched, the dynamic shifted. Instead of opening dashboards or composing prompts, Shivakami could ask questions and issue instructions verbally: "Who hasn't paid this month?" "Send reminders to the Tuesday class." "How many students attended last week?"

Handling specialized vocabulary in speech-to-text

Bharatanatyam, one of the world's oldest continuously practiced classical dance traditions, originated in Tamil Nadu and draws from centuries of Sanskrit and South Indian musical traditions. Its technical vocabulary blends Tamil and Sanskrit terms. Teachers routinely use specialized words like thattadavu, nattadavu, tattimetti, alarippu, jatiswaram, and varnam in conversation. When speech-to-text mishandles these terms, the user must read the transcript, correct the error, and resume their flow. At that point, voice becomes a friction point rather than an efficiency gain.

CBB Labs evaluated several speech-to-text providers before selecting AssemblyAI for its ability to handle specialized terminology reliably. Shivakami can speak a full message in specialized vocabulary and receive an accurate transcription without stopping to re-correct. The accuracy continues improving as the system's knowledge base of her terminology grows. AssemblyAI handles the transcription layer while Sangs learns to interpret her world on top of it.

The result: she speaks, AssemblyAI transcribes accurately, and Sangs executes the intended action. Approximately 9 out of 10 voice requests now carry out the intended action correctly, making the system practical for daily use.

‍

What voice enabled

With voice, both the volume of work and the time required shifted dramatically. Shivakami now completes nearly 4× as much studio administration through voice as she previously did through chat, in approximately 10% of the time. Sending emails, creating courses, uploading materials, and managing student billing are all now handled by voice.

Consider the time difference: 15 minutes to type an email, 15 minutes to create a new course, and 30 minutes to add links via chat and specify their placement. That same work now takes under 5 minutes. Shivakami saves 6 hours of administrative work each week. She has shifted from operating software to delegating through voice, moving from tool use toward having an administrative assistant.

‍

Why voice adoption was immediate

The adoption curve for voice was nonexistent. Shivakami already runs her day through voice, sending students WhatsApp voice notes the same way she communicates with them in class. When the software began listening, there was no new behavior to learn. The interface finally matched the way she already worked. Text-based chat, by contrast, required her to translate her natural communication style into typing, creating friction. Voice simply let her talk.

Looking ahead

As CBB Labs extends voice agents like Sangs into additional workflows—including feedback, sign-ups, and onboarding—the team is strengthening evaluations to keep the system reliable as its scope grows. Over the next 1 to 3 years, DanceHall Studios is expected to become voice-first. The same administrative load appears across countless local service businesses, and as accuracy and evaluation rigor increase, voice-agentic automation will likely expand into customer management, finance, and operations across that entire category.

Start building what's next

Top Voice AI companies rely on AssemblyAI’s speech-to-text and speech understanding models to launch groundbreaking products fast and scale with ease.

Get started now

A partnership built on support and scale

Super

Super, a vertical Voice AI platform for real estate, switched to AssemblyAI's Universal-3 Pro Streaming to improve turn detection and keyterm accuracy for high-volume phone agents, and cut STT costs by ~30% in the process.

JotPsych

JotPsych introduces an ambient AI scribe designed specifically for behavioral health, engineered around clinical workflows instead of technical plumbing. Fast, accurate, and purpose-built for mental-health providers.

Nylas

Nylas, a leading developer API platform, chose integration over building in-house and integrated meeting intelligence capabilities in less than a day with AssemblyAI. Thousands of developers globally can now add meeting intelligence to their applications with a single line of code.

Unlock the value of voice data

Build what’s next on the platform powering thousands of the industry’s leading of Voice AI apps.

Try our API for free Contact sales