# Crafting Data Solutions: Shrinking Pie and Leveraging Insights for Optimal Data Learning - ML 176 Page: https://stenobird.com/podcast/adventures-in-machine-learning/crafting-data-solutions-shrinking-pie-and-leveraging-insights-for-optimal-data-learning-ml-176 Text version: https://stenobird.com/podcast/adventures-in-machine-learning/crafting-data-solutions-shrinking-pie-and-leveraging-insights-for-optimal-data-learning-ml-176.md Podcast: [Adventures in Machine Learning](https://stenobird.com/podcast/adventures-in-machine-learning) Published: 2024-11-28T11:00:00+00:00 Episode link: https://www.spreaker.com/episode/crafting-data-solutions-shrinking-pie-and-leveraging-insights-for-optimal-data-learning-ml-176--63122287 Audio file: https://dts.podtrac.com/redirect.mp3/api.spreaker.com/download/episode/63122287/ml_176.mp3 Processing state: processed JSON: https://stenobird.com/v1/public/podcasts/adventures-in-machine-learning/episodes/crafting-data-solutions-shrinking-pie-and-leveraging-insights-for-optimal-data-learning-ml-176 Duration seconds: 3343 ## Resource As data growth outpaces Moore's Law, traditional database performance is becoming unsustainable. Barzan Mozafari explains how automated cloud optimization and query rewriting can bridge this gap and reclaim wasted infrastructure spend. ## Highlights - Main idea: Data growth is currently outpacing hardware improvements, creating a performance gap that requires intelligent automation rather than just more hardware - Practical takeaway: Use automated workload intelligence to optimize existing data stacks like Snowflake without needing to migrate platforms - Failure mode: Relying on manual infrastructure management leads to exponential cost increases as data volumes scale - Lesson: The 'fail fast' mentality of academic research—testing ideas through rapid experimentation—is highly effective for B2B software development - Future trend: Large Language Models (LLMs) are being applied to query rewriting to significantly enhance database efficiency ## Topics Cloud Optimization, Data Engineering, Snowflake, Query Rewriting, Infrastructure Costs, Machine Learning, Database Performance, Automation ## Chapters - 1:05 — The Crisis of Data Growth: Barzan Mozafari discusses why the divergence between data volume growth and Moore's Law makes traditional database scaling unsustainable. - 5:45 — The Keebo Business Model: An exploration of the incentive structures in cloud optimization and how Keebo aligns its success with customer cost savings. - 14:45 — Avoiding Platform Lock-in: Why modern optimization tools should work with your existing data stack rather than forcing expensive migrations to new platforms. - 23:50 — Unlocking Business Value: How reducing infrastructure overhead allows engineering teams to redirect resources toward core product innovation and business growth. - 33:00 — Applying Academic Rigor to Industry: The benefits of bringing research-driven 'fail fast' methodologies and deep problem-solving skills into the commercial software lifecycle. - 47:30 — The Future of Query Optimization: A look at the potential of LLMs in query rewriting and the challenges of managing expectations around AI agents in data engineering. ## Actions - request_transcript: `POST https://stenobird.com/v1/public/podcasts/adventures-in-machine-learning/episodes/crafting-data-solutions-shrinking-pie-and-leveraging-insights-for-optimal-data-learning-ml-176/transcription-requests` — Idempotently request low-priority transcript generation for this episode. - read_markdown: `GET https://stenobird.com/podcast/adventures-in-machine-learning/crafting-data-solutions-shrinking-pie-and-leveraging-insights-for-optimal-data-learning-ml-176.md` — Read the agent-friendly Markdown representation of this episode resource. A page view does not enqueue transcription. Agents should invoke `request_transcript` explicitly when they need this episode processed. ## Transcript Full transcripts are not published on public pages unless there is a clear rights basis.