customers
All customer stories
Top Voice AI companies are building with Assembly.
resources
Latest Release
Universal 3.5 Pro Realtime
The first streaming speech-to-text model that takes the agent's question as input.
resources
Both pass through provider rates at 0% markup. Here’s what AssemblyAI’s LLM Gateway adds for Voice AI:
Transcribe with AssemblyAI’s own speech models and apply any LLM to the result in one pipeline. Vercel AI Gateway routes text, image, and video—but not speech.
Like Vercel, AssemblyAI passes through provider token rates with no gateway markup—so you can lead with capabilities, not cost.
Configure a primary model and any number of backups per request. If a provider errors or rate-limits, the Gateway retries the next model—no code changes.
GPT, Claude, Gemini, Qwen, Mistral, and more behind one OpenAI-compatible API. New models are added the day they launch.
Drop into any OpenAI SDK or HTTP client—no framework lock-in. Change a base URL and a model string, and the rest of your code keeps working.
Route requests through EU-resident infrastructure, on by default for EU traffic—not a team-wide add-on you have to enable.
Opt into zero data retention per request or project-wide, so prompts and responses are never stored.
Sign a Business Associate Addendum for applications that process PHI, backed by SOC 2 Type 2, ISO 27001, PCI DSS, and GDPR.
Get your free API key and make your first Gateway call in minutes—no framework required.
The same transcript, four different jobs, four different models — all routed through one endpoint.
What is runner's knee?
Based on the transcript, runner's knee is a condition characterized by pain behind or around the kneecap. It is caused by overuse, muscle imbalance and inadequate stretching. Symptoms include pain under or around the kneecap and pain when walking.
Summarize this call in three bullet points
• Caller reports persistent knee pain after long runs. • Clinician identifies likely runner's knee from overuse and weak hip stabilizers. • Recommended next steps: rest, ice, targeted glute and quad strengthening.
Extract symptoms, conditions, and recommended treatments as JSON.
{ "condition": "runner's knee", "symptoms": ["kneecap pain", "pain when walking"], "causes": ["overuse", "muscle imbalance", "inadequate stretching"], "treatments": ["rest", "ice", "strengthening"] }
Write a 2-sentence patient follow-up email summarizing the visit.
Hi — thanks for coming in today. As we discussed, your knee pain is consistent with patellofemoral pain syndrome (runner's knee), and I'd recommend two weeks of reduced mileage along with the strengthening routine I sent over.
Put our Voice AI models and the LLM Gateway to the test in our no-code playground.