# [State of AI Startups] Memory/Learning, RL Envs & DBT-Fivetran — Sarah Catanzaro, Amplify Page: https://stenobird.com/podcast/latent-space-ai-engineer/state-of-ai-startups-memory-learning-rl-envs-dbt-fivetran-sarah-catanzaro-amplify Text version: https://stenobird.com/podcast/latent-space-ai-engineer/state-of-ai-startups-memory-learning-rl-envs-dbt-fivetran-sarah-catanzaro-amplify.md Podcast: [Latent Space: The AI Engineer Podcast](https://stenobird.com/podcast/latent-space-ai-engineer) Published: 2025-12-30T14:00:00+00:00 Episode link: https://www.latent.space/p/state-of-ai-startups-memorylearning Audio file: https://api.substack.com/feed/podcast/186610557/c925c47e33b1d08b87213416fdb3b3b8.mp3 Processing state: processed JSON: https://stenobird.com/v1/public/podcasts/latent-space-ai-engineer/episodes/state-of-ai-startups-memory-learning-rl-envs-dbt-fivetran-sarah-catanzaro-amplify Duration seconds: 1722 ## Resource From investing through the modern data stack era (DBT, Fivetran, and the analytics explosion) to now investing at the frontier of AI infrastructure and applications at Amplify Partners , Sarah Catanzaro has spent years at the intersection of data, compute, and intelligence—watching categories emerge, merge, and occasionally disappoint. We caught up with Sarah live at NeurIPS 2025 to dig into the state of AI startups heading into 2026: why $100M+ seed rounds with no near-term roadmap are now the norm (and why that terrifies her), what the DBT-Fivetran merger really signals about the modern data stack (spoiler: it’s not dead, just ready for IPO), how frontier labs are using DBT and Fivetran to manage training data and agent analytics at scale, why data catalogs failed as standalone products but might succeed as metadata services for agents, the consumerization of AI and why personalization (memory, continual learning, K-factor) is the 2026 unlock for retention and growth, why she thinks RL environments are a fad and real-world logs beat synthetic clones every time, and her thesis for the most exciting AI startups: companies that marry hard research problems (RAG, rule-following, continual learning) with killer applications that were simply impossible before. We discuss: * The DBT-Fivetran merger : not the death of the modern data stack, but a path to IPO scale (targeting $600M+ combined revenue) and a signal that both companies were already winning their categories * How frontier labs use data infrastructure : DBT and Fivetran for training data curation, agent analytics, and managing increasingly complex interactions—plus the rise of transactional databases (RocksDB) and efficient data loading (Vortex) for GPU-bound workloads * Why data catalogs failed : built for humans… ## Actions - request_transcript: `POST https://stenobird.com/v1/public/podcasts/latent-space-ai-engineer/episodes/state-of-ai-startups-memory-learning-rl-envs-dbt-fivetran-sarah-catanzaro-amplify/transcription-requests` — Idempotently request low-priority transcript generation for this episode. - read_markdown: `GET https://stenobird.com/podcast/latent-space-ai-engineer/state-of-ai-startups-memory-learning-rl-envs-dbt-fivetran-sarah-catanzaro-amplify.md` — Read the agent-friendly Markdown representation of this episode resource. A page view does not enqueue transcription. Agents should invoke `request_transcript` explicitly when they need this episode processed. ## Transcript Full transcripts are not published on public pages unless there is a clear rights basis.