Episode
How can you test your code when you don’t know what’s in it?
- Podcast
- The Stack Overflow Podcast
- Published
- Mar 31, 2026
- Duration seconds
- 1818
- Processing state
processed
Actions
POST https://stenobird.com/v1/public/podcasts/the-stack-overflow-podcast/episodes/how-can-you-test-your-code-when-you-don-t-know-what-s-in-it/transcription-requests
Idempotently request low-priority transcript generation for this episode.GET https://stenobird.com/podcast/the-stack-overflow-podcast/how-can-you-test-your-code-when-you-don-t-know-what-s-in-it.md
Read the agent-friendly Markdown representation of this episode resource.
Summary
The shift toward Model Context Protocol (MCP) and LLM-driven agents introduces non-determinism that breaks traditional software testing. This discussion explores how developers can validate agentic workflows when the sequence of tool calls is decided on the fly by an AI.
Topics
- Model Context Protocol
- AI Agents
- Software Testing
- LLM-driven Development
- MCP Servers
- Agentic Workflows
- Non-determinism
- Software Architecture
Highlights
- Main idea: MCP is becoming the new foundational layer for AI agents, shifting the abstraction level from LLM prompts to tool invocation
- Failure mode: Relying on rigid, hard-coded workflows for AI agents defeats the purpose of the LLM's ability to decide the best path dynamically
- Practical takeaway: Testing must move toward validating outcomes and bounds rather than checking specific, hand-authored code paths
- Economic trade-off: AI-generated code may prioritize development velocity over performance, requiring a new pricing strategy to cover higher compute costs
- Future outlook: As source code becomes a commodity, the value in engineering will shift toward data construction and managing the complexity of agentic interactions
Chapters
1:00Guest Background: Fitz Nowlan discusses his journey from PhD research in distributed systems to leading AI architecture at SmartBear.3:20The Challenge of Non-deterministic Testing: The difficulty of testing MCP servers where the LLM decides the tool sequence on the fly, making rigid workflows impossible.7:50Prompt Engineering vs. Long-term Value: Why developers should avoid falling in love with specific prompts that may be rendered obsolete by newer, better models.12:35The Future of Unit Testing: A debate on whether traditional unit testing loses relevance when AI can effectively handle assertion-based testing.21:20Commoditization of CRUD Apps: How AI-driven development lowers margins for basic applications and shifts the focus toward more complex engineering problems.27:50MCP as the New Foundation: The transition from LLMs being the core focus to MCP providing the essential infrastructure for agentic tool use.