# Voice Agent Use Cases Page: https://stenobird.com/podcast/mlops-community/voice-agent-use-cases Text version: https://stenobird.com/podcast/mlops-community/voice-agent-use-cases.md Podcast: [MLOps.community](https://stenobird.com/podcast/mlops-community) Published: 2026-05-01T17:00:00+00:00 Episode link: https://podcasters.spotify.com/pod/show/mlops/episodes/Voice-Agent-Use-Cases-e3ileg6 Audio file: https://anchor.fm/s/174cb1b8/podcast/play/119240646/https%3A%2F%2Fd3ctxlq1ktw2nl.cloudfront.net%2Fstaging%2F2026-3-29%2F423136952-44100-2-d67b1521393a6.mp3 Processing state: processed JSON: https://stenobird.com/v1/public/podcasts/mlops-community/episodes/voice-agent-use-cases Duration seconds: 3064 ## Resource This episode is brought to you by the MLflow team. Check out more information at MLflow.org . What does it actually take to build voice AI at a billion-interaction scale? This episode features an ex-Amazon voice AI engineer who built customer support systems handling 2 billion+ interactions β€” now working on next-gen voice agent platforms. Anurag digs deep into the real engineering tradeoffs, design patterns, and use cases that separate production-grade voice agents from demos. Voice Agent Use Cases // MLOps Podcast #372 with Anurag Beniwal, Member of the Technical Staff at ElevenLabs πŸŽ™οΈ Topics covered: πŸ”Ή Cascaded vs. speech-to-speech β€” Why cascaded systems still win in production, and how to make them feel natural without sacrificing control πŸ”Ή Latency masking β€” Foreground/background model architecture and how to buy yourself time while deep retrieval runs πŸ”Ή Constellation of models β€” Using Haiku for tool calling, fine-tuned smaller models for response generation, and why "one model for everything" breaks at scale πŸ”Ή Turn-taking & ASR challenges β€” Why voice is harder than chat: accents, noise, silence detection, and domain-specific fine-tuning πŸ”Ή Level 1 vs Level 2 customer support β€” Why today's agents max out at Level 1 and what it takes to capture Level 2 expert judgment πŸ”Ή Inbound vs. outbound sales agents β€” Where voice agents are already winning, and why inbound lead qualification beats cold outbound πŸ”Ή Booking, reservations & concierge β€” The clearest near-term wins for voice agents across hospitality, home services, and SMBs πŸ”Ή Continual learning from natural language feedback β€” How to build agents that improve from real operator feedback without ML expertise πŸ”Ή Conversational TTS β€” Why passing full conversation history to your TTS model changes everythi… ## Actions - request_transcript: `POST https://stenobird.com/v1/public/podcasts/mlops-community/episodes/voice-agent-use-cases/transcription-requests` β€” Idempotently request low-priority transcript generation for this episode. - read_markdown: `GET https://stenobird.com/podcast/mlops-community/voice-agent-use-cases.md` β€” Read the agent-friendly Markdown representation of this episode resource. A page view does not enqueue transcription. Agents should invoke `request_transcript` explicitly when they need this episode processed. ## Transcript Full transcripts are not published on public pages unless there is a clear rights basis.