Episode
Baseten CEO and co-founder Tuhin Srivastava on inference and feedback loops
- Podcast
- Scaling DevTools
- Published
- Nov 14, 2025
- Duration seconds
- 1451
- Processing state
processed
Actions
POST https://stenobird.com/v1/public/podcasts/scaling-devtools/episodes/baseten-ceo-and-co-founder-tuhin-srivastava-on-inference-and-feedback-loops/transcription-requests
Idempotently request low-priority transcript generation for this episode.GET https://stenobird.com/podcast/scaling-devtools/baseten-ceo-and-co-founder-tuhin-srivastava-on-inference-and-feedback-loops.md
Read the agent-friendly Markdown representation of this episode resource.
Summary
Baseten CEO Tuhin Srivastava discusses the strategic shift from building ML primitives to scaling a high-growth GenAI inference platform. He explores how to navigate rapid market shifts by prioritizing developer experience and tight customer feedback loops.
Topics
- GenAI
- Inference Infrastructure
- Developer Experience
- Machine Learning
- DevTools
- Scaling Startups
- Cloud Computing
- Software Engineering
Highlights
- Main idea: As model capabilities commoditize, the real value shifts to the underlying inference infrastructure and developer experience
- Practical takeaway: Minimize the distance between you and your customers to create ultra-fast feedback loops
- Failure mode: Avoid making irreversible early architectural abstractions before you have fully honed the product's developer experience
- Strategic insight: The future of enterprise software lies in being either AI-first or AI-enabled, driving massive demand for inference-centric tools
- Product philosophy: Focus on making the easy things easy and the hard things possible without blocking the developer's workflow
Chapters
1:00The Pre-GenAI Build Phase: Reflecting on the years spent building machine learning primitives before the massive market demand arrived in 2023.2:45Navigating Market Shifts: How Baseten repositioned itself to meet the sudden explosion of interest in generative AI.8:05The Power of Feedback Loops: Why small companies must prioritize direct, rapid communication with customers to iterate effectively.11:40The Future of Inference: Discussing why every company will eventually need inference-based infrastructure as models become commoditized.15:20Prioritizing Developer Experience: The importance of building tools that provide observability and ease of use without creating friction.20:45Advice for DevTool Founders: Lessons on managing early technical decisions and the importance of staying close to the user.