customers
All customer stories
Top Voice AI companies are building with Assembly.
resources
Latest Release
Universal 3.5 Pro Realtime
The first streaming speech-to-text model that takes the agent's question as input.
resources
Both pass through provider rates at 0% markup. Here’s what AssemblyAI’s LLM Gateway adds for Voice AI:
Transcribe with AssemblyAI’s own speech models and apply any LLM to the result in one pipeline. Helicone observes text LLM traffic only—it has no native speech or audio.
Like Helicone’s gateway, AssemblyAI passes through provider token rates with no markup—and observability isn’t gated behind a separate paid tier.
Configure a primary model and any number of backups per request. If a provider errors or rate-limits, the Gateway retries the next model—no code changes.
GPT, Claude, Gemini, Qwen, Mistral, and more behind one OpenAI-compatible API. New models are added the day they launch.
Drop into any OpenAI SDK or HTTP client. Change a base URL and a model string, and the rest of your code keeps working.
Route requests through EU-resident infrastructure, on by default for EU traffic—not just a region you select yourself.
Opt into zero data retention per request or project-wide, so prompts and responses are never stored.
Sign a Business Associate Addendum for applications that process PHI—without gating it behind a premium observability tier.
Get your free API key and make your first Gateway call in minutes—no commitments or minimums.
The same transcript, four different jobs, four different models — all routed through one endpoint.
What is runner's knee?
Based on the transcript, runner's knee is a condition characterized by pain behind or around the kneecap. It is caused by overuse, muscle imbalance and inadequate stretching. Symptoms include pain under or around the kneecap and pain when walking.
Summarize this call in three bullet points
• Caller reports persistent knee pain after long runs. • Clinician identifies likely runner's knee from overuse and weak hip stabilizers. • Recommended next steps: rest, ice, targeted glute and quad strengthening.
Extract symptoms, conditions, and recommended treatments as JSON.
{ "condition": "runner's knee", "symptoms": ["kneecap pain", "pain when walking"], "causes": ["overuse", "muscle imbalance", "inadequate stretching"], "treatments": ["rest", "ice", "strengthening"] }
Write a 2-sentence patient follow-up email summarizing the visit.
Hi — thanks for coming in today. As we discussed, your knee pain is consistent with patellofemoral pain syndrome (runner's knee), and I'd recommend two weeks of reduced mileage along with the strengthening routine I sent over.
Put our Voice AI models and the LLM Gateway to the test in our no-code playground.