# Scott Haines on the Future of Data Engineering Page: https://stenobird.com/podcast/data-engineering-central-podcast-7106217/scott-haines-on-the-future-of-data-engineering Text version: https://stenobird.com/podcast/data-engineering-central-podcast-7106217/scott-haines-on-the-future-of-data-engineering.md Podcast: [Data Engineering Central Podcast](https://stenobird.com/podcast/data-engineering-central-podcast-7106217) Published: 2025-12-17T13:44:00+00:00 Episode link: https://dataengineeringcentral.substack.com/p/scott-haines-on-the-future-of-data Audio file: https://api.substack.com/feed/podcast/181261013/09ad0a8eed94f4bda0e139df22ed9087.mp3 Processing state: not_requested JSON: https://stenobird.com/v1/public/podcasts/data-engineering-central-podcast-7106217/episodes/scott-haines-on-the-future-of-data-engineering Duration seconds: 6660 ## Resource In this episode, I sit down with Scott Haines — O’Reilly author, Databricks MVP, and veteran of Yahoo, Nike, and Twilio — for a wide-ranging conversation on the real state of modern data engineering. We dig into open-source ecosystems, Lakehouse architectures, the evolution of Spark, streaming, what’s broken and what’s working in today’s data tooling, and the lessons Scott has learned scaling platforms at some of the biggest companies in the world. If you care about data engineering, architecture, OSS, or the future of the modern data stack, you’ll love this one. Thanks for reading Data Engineering Central! This post is public so feel free to share it. Make sure to follow Scott here on Substack , and over on GitHub. This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit dataengineeringcentral.substack.com/subscribe ## Actions - request_transcript: `POST https://stenobird.com/v1/public/podcasts/data-engineering-central-podcast-7106217/episodes/scott-haines-on-the-future-of-data-engineering/transcription-requests` — Idempotently request low-priority transcript generation for this episode. - read_markdown: `GET https://stenobird.com/podcast/data-engineering-central-podcast-7106217/scott-haines-on-the-future-of-data-engineering.md` — Read the agent-friendly Markdown representation of this episode resource. A page view does not enqueue transcription. Agents should invoke `request_transcript` explicitly when they need this episode processed. ## Transcript Full transcripts are not published on public pages unless there is a clear rights basis.