{"podcast":{"title":"MLOps.community","slug":"mlops-community","podcast_index_feed_id":28679,"rss_url":"https://anchor.fm/s/174cb1b8/podcast/rss","website_url":"https://mlops.community","image_url":"https://d3t3ozftmdmh3i.cloudfront.net/production/podcast_uploaded_nologo/3809022/3809022-1612190855115-e91f8b881173f.jpg","author":"Demetrios","episode_count":516,"summary":"Relaxed Conversations around getting AI into production, whatever shape that may come in (agentic, traditional ML, LLMs, Vibes, etc)","last_synced_at":null,"page_url":"https://stenobird.com/podcast/mlops-community"},"episode":{"title":"Durable Execution and Modern Distributed Systems","slug":"durable-execution-and-modern-distributed-systems","published_at":"2026-03-17T17:00:36+00:00","page_url":"https://stenobird.com/podcast/mlops-community/durable-execution-and-modern-distributed-systems","show_page_url":"https://stenobird.com/podcast/mlops-community","url":"https://podcasters.spotify.com/pod/show/mlops/episodes/Durable-Execution-and-Modern-Distributed-Systems-e3giukm","audio_url":"https://anchor.fm/s/174cb1b8/podcast/play/117061718/https%3A%2F%2Fd3ctxlq1ktw2nl.cloudfront.net%2Fstaging%2F2026-2-17%2F420203925-44100-2-919e18cb57386.mp3","summary":"Durable execution provides a new paradigm for building reliable, long-running applications by making regular code crash-proof. This approach allows developers to manage complex, stateful workflows—including LLM-driven agents—without manually handling distributed system failures.","meta_description":"Learn how Durable Execution and Temporal enable reliable, scalable, and crash-proof distributed systems for modern AI agents and data pipelines.","key_points":["Main idea: Durable execution abstracts away the complexity of distributed systems, ensuring code runs to completion despite server or API failures","Practical takeaway: Developers can use standard programming models (like Python's async/await) to build robust, stateful agentic workflows","Failure mode: Traditional data pipelines often struggle with reliability in the cloud; durable execution solves this by separating business logic from reliability concerns","Technical advantage: The model supports complex interactions through signals, updates, and queries, allowing real-time manipulation of running workflows","Future trend: The convergence of durable execution and LLMs enables a new class of autonomous agents that can interact with the world reliably over long periods"],"chapters":[{"start_ms":60000,"title":"The Core of Durable Execution","summary":"An introduction to making software crash-proof by ensuring programs run to completion regardless of cloud-native failures like flaky servers or API outages."},{"start_ms":355000,"title":"Reliability and Regional Resilience","summary":"Exploring how durable execution provides a higher level of reliability, even during major cloud provider outages or regional failures."},{"start_ms":610000,"title":"Managing State in Workflows","summary":"A look at how workflows maintain state and evolve as they interact with external tools and LLMs."},{"start_ms":1155000,"title":"Platform Engineering and Productivity","summary":"How platform teams use durable execution to provide standardized, reliable infrastructure that accelerates developer productivity."},{"start_ms":1435000,"title":"Building Agentic Systems","summary":"Discussing the increasing complexity and necessity of durable execution when building autonomous AI agents."},{"start_ms":1995000,"title":"Interacting with Running Workflows","summary":"How to use primitives like signals and queries to monitor and interact with active agent processes."},{"start_ms":3095000,"title":"The Evolution of Serverless","summary":"Comparing the shift from the serverless hype to the practical necessity of durable, stateful execution in modern infrastructure."}],"topics":["Durable Execution","Distributed Systems","AI Agents","LLM Workflows","Temporal","Cloud Reliability","Software Engineering","Platform Engineering"],"duration_seconds":3636,"processing_state":"processed","actions":[{"name":"request_transcript","method":"POST","url":"https://stenobird.com/v1/public/podcasts/mlops-community/episodes/durable-execution-and-modern-distributed-systems/transcription-requests","description":"Idempotently request low-priority transcript generation for this episode."},{"name":"read_markdown","method":"GET","url":"https://stenobird.com/podcast/mlops-community/durable-execution-and-modern-distributed-systems.md","description":"Read the agent-friendly Markdown representation of this episode resource."}]}}