# How Capital One Delivers Multi-Agent Systems with Rashmi Shetty - #765

Page: https://stenobird.com/podcast/twiml-ai-podcast/how-capital-one-delivers-multi-agent-systems-with-rashmi-shetty-765
Text version: https://stenobird.com/podcast/twiml-ai-podcast/how-capital-one-delivers-multi-agent-systems-with-rashmi-shetty-765.md
Podcast: [The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)](https://stenobird.com/podcast/twiml-ai-podcast)
Published: 2026-04-16T23:48:00+00:00
Episode link: https://twimlai.com/podcast/twimlai/how-capital-one-delivers-multi-agent-systems
Audio file: https://pscrb.fm/rss/p/traffic.megaphone.fm/MLN9114691307.mp3?updated=1776383902
Processing state: processed
JSON: https://stenobird.com/v1/public/podcasts/twiml-ai-podcast/episodes/how-capital-one-delivers-multi-agent-systems-with-rashmi-shetty-765
Duration seconds: 3258

## Resource

Capital One scales multi-agent systems in highly regulated environments by separating agent design from runtime governance. The discussion explores how their 'Chat Concierge' uses specialized models and robust observability to execute complex, goal-oriented actions.

## Highlights
- Main idea: Multi-agent systems are essential for breaking down complex, multifaceted problems into specific, goal-oriented actions
- Practical takeaway: Use a platform-centric approach to separate agent design from runtime governance, embedding guardrails and cyber controls at agent boundaries
- Failure mode: Avoid treating agents as isolated units; instead, evaluate them using end-to-end frameworks that account for the entire stochastic workflow
- Strategy: Leverage model distillation and fine-tuning to achieve the necessary balance between reasoning capabilities and low-latency performance
- Technical takeaway: Implement a robust observability stack and pluggable SDKs to allow the platform to evolve alongside rapidly advancing LLM capabilities

## Topics

Multi-Agent Systems, Generative AI Platform, Enterprise AI Governance, Model Distillation, AI Observability, LLM Evaluation, Agentic Workflows, Cloud-Native AI

## Chapters
- 1:05 — The Evolution of Intelligence: Rashmi discusses her transition from academic research in distributed intelligence to operationalizing agentic AI in the enterprise.
- 5:10 — The Shift to Multi-Agentic Workflows: An exploration of why complex problems require moving beyond simple LLM responses toward systems capable of taking specific actions.
- 9:55 — Operating in Regulated Environments: How Capital One manages the tension between deploying cutting-edge agentic technology and maintaining strict regulatory compliance.
- 17:20 — Platform Architecture and Data Lineage: The technical challenges of managing context, memory, and tool integration across multiple agents without exhausting context windows.
- 21:10 — Evaluation and Golden Datasets: Moving from individual agent evaluations to end-to-end evaluation frameworks for complex, multi-step workflows.
- 25:30 — Designing for Future Scalability: How a robust observability stack and pluggable SDKs allow for the seamless integration of new AI components and models.
- 29:45 — Model Specialization and Distillation: Using fine-tuning and student-teacher distillation to optimize for reasoning, latency, and personalized customer experiences.
- 45:55 — Treating Agentic AI as a System: Final lessons on the importance of treating agentic workflows as integrated systems rather than a collection of disparate tools.

## Actions

- request_transcript: `POST https://stenobird.com/v1/public/podcasts/twiml-ai-podcast/episodes/how-capital-one-delivers-multi-agent-systems-with-rashmi-shetty-765/transcription-requests` — Idempotently request low-priority transcript generation for this episode.
- read_markdown: `GET https://stenobird.com/podcast/twiml-ai-podcast/how-capital-one-delivers-multi-agent-systems-with-rashmi-shetty-765.md` — Read the agent-friendly Markdown representation of this episode resource.

A page view does not enqueue transcription. Agents should invoke `request_transcript` explicitly when they need this episode processed.

## Transcript

Full transcripts are not published on public pages unless there is a clear rights basis.