{"podcast":{"title":"The Data Exchange with Ben Lorica","slug":"the-data-exchange-with-ben-lorica","podcast_index_feed_id":1196000,"rss_url":"https://rss.buzzsprout.com/682433.rss","website_url":"https://thedataexchange.media/","image_url":"https://storage.buzzsprout.com/ljk0yj7r22pi61grsmelnsoa9084?.jpg","author":"Ben Lorica","episode_count":345,"summary":"A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].","last_synced_at":null,"page_url":"https://stenobird.com/podcast/the-data-exchange-with-ben-lorica"},"episode":{"title":"The Truth About Agents in Production","slug":"the-truth-about-agents-in-production","published_at":"2025-12-31T12:00:00+00:00","page_url":"https://stenobird.com/podcast/the-data-exchange-with-ben-lorica/the-truth-about-agents-in-production","show_page_url":"https://stenobird.com/podcast/the-data-exchange-with-ben-lorica","url":"https://dts.podtrac.com/redirect.mp3/www.buzzsprout.com/682433/episodes/18412003-the-truth-about-agents-in-production.mp3","audio_url":"https://dts.podtrac.com/redirect.mp3/www.buzzsprout.com/682433/episodes/18412003-the-truth-about-agents-in-production.mp3","summary":"A panel of industry leaders from Anthropic, LlamaIndex, Pydantic, and Arize AI discusses the transition from simple LLM prompts to complex agentic workflows. The discussion focuses on the practical engineering challenges of reliability, evaluation, and tool integration in production environments.","meta_description":"Experts from Anthropic, LlamaIndex, and Arize AI discuss the reality of deploying AI agents, focusing on evals, type safety, and computer use.","key_points":["Main idea: Successful agent deployment relies on translating specific business processes into workflows rather than forcing AI into existing structures","Practical takeaway: Implementing type safety in agent frameworks is critical for the reliability of coding agents","Failure mode: Over-reliance on offline evaluations can lead to a lack of visibility into real-world user friction and production errors","Main idea: The future of agents lies in 'computer use' and the ability to interact with unstructured interfaces where APIs do not exist","Practical takeaway: Using high-reasoning models for planning and delegating execution to faster, cheaper models can optimize agentic performance"],"chapters":[{"start_ms":60000,"title":"Architectural Patterns in Agents","summary":"The panel explores successful agent architectures, highlighting the importance of type safety in coding agents."},{"start_ms":170000,"title":"Product-Led AI Development","summary":"Discussion on why the best teams focus on solving user problems rather than simply implementing new AI capabilities."},{"start_ms":280000,"title":"The Challenge of Agent Planning","summary":"An analysis of the difficulties in managing context handoffs and planning across multi-agent systems."},{"start_ms":620000,"title":"The Role of Evaluations","summary":"A debate on the necessity of offline vs. online evaluations and the value of product analytics in measuring agent success."},{"start_ms":1060000,"title":"MCP and Computer Use","summary":"Exploring the Model Context Protocol (MCP) and the potential for agents to navigate software via direct computer interaction."},{"start_ms":1290000,"title":"The Future of Agent Interfaces","summary":"Predictions on standardized interfaces like SQL and the evolution of RAG into more active, tool-using search agents."}],"topics":["Agentic AI","LLM Evaluation","AI Observability","Computer Use","Model Context Protocol","RAG","Software Engineering","AI Infrastructure"],"duration_seconds":1537,"processing_state":"processed","actions":[{"name":"request_transcript","method":"POST","url":"https://stenobird.com/v1/public/podcasts/the-data-exchange-with-ben-lorica/episodes/the-truth-about-agents-in-production/transcription-requests","description":"Idempotently request low-priority transcript generation for this episode."},{"name":"read_markdown","method":"GET","url":"https://stenobird.com/podcast/the-data-exchange-with-ben-lorica/the-truth-about-agents-in-production.md","description":"Read the agent-friendly Markdown representation of this episode resource."}]}}