Episode
#528: Python apps with LLM building blocks
- Podcast
- Talk Python To Me
- Published
- Nov 30, 2025
- Duration seconds
- 4606
- Processing state
processed
Actions
POST https://stenobird.com/v1/public/podcasts/talk-python-to-me/episodes/528-python-apps-with-llm-building-blocks/transcription-requests
Idempotently request low-priority transcript generation for this episode.GET https://stenobird.com/podcast/talk-python-to-me/528-python-apps-with-llm-building-blocks.md
Read the agent-friendly Markdown representation of this episode resource.
Summary
Treat Large Language Models as standard, decoupled APIs within your Python architecture rather than magic black boxes. Learn how to implement clear boundaries, focused endpoints, and robust monitoring to build reliable AI-integrated applications.
Topics
- Python
- LLM
- Software Architecture
- API Design
- Machine Learning
- Caching
- Data Engineering
- AI Integration
Highlights
- Main idea: Treat LLM calls as standard API dependencies with defined boundaries and small, focused endpoints
- Practical takeaway: Use caching and inspection patterns to manage costs and verify the reliability of non-deterministic responses
- Failure mode: Avoid treating LLMs as a replacement for structured logic; instead, use them as building blocks within a larger, typed system
- Practical takeaway: Implement monitoring and evaluations to ensure LLM performance meets application requirements
- Main idea: Leverage abstraction layers to switch between different providers like OpenAI and Anthropic without rewriting core logic
Chapters
6:45The AI Hype Landscape: A discussion on the current state of AI hype and the impact of LLM advancements on the developer ecosystem.12:25Software Engineering for Data Science: Exploring how software engineering best practices can bring reliability to data science and machine learning workflows.24:05LLMs as Modular Building Blocks: Architectural strategies for treating LLM APIs as interchangeable components regardless of the provider.29:50Caching and Storage Strategies: Using local storage and SQLite for caching LLM responses to improve performance and reduce latency.41:50Abstraction Layers and Tooling: Using libraries to abstract across different LLM providers like Anthropic and OpenAI.53:10Structured Output and Type Safety: Leveraging modern LLM capabilities for structured tasks and the importance of using types in Python.1:10:50Recommended Tools and Final Advice: A roundup of useful libraries and tools for building production-ready Python applications with LLMs.