Episode
From Data Models to Mind Models: Designing AI Memory at Scale
- Podcast
- Data Engineering Podcast
- Published
- Feb 22, 2026
- Duration seconds
- 3467
- Processing state
processed
Actions
POST https://stenobird.com/v1/public/podcasts/data-engineering-podcast/episodes/from-data-models-to-mind-models-designing-ai-memory-at-scale/transcription-requests
Idempotently request low-priority transcript generation for this episode.GET https://stenobird.com/podcast/data-engineering-podcast/from-data-models-to-mind-models-designing-ai-memory-at-scale.md
Read the agent-friendly Markdown representation of this episode resource.
Summary
Designing AI memory requires moving beyond simple vector stores to a multi-layered architecture of graph and vector layers. This discussion explores how to implement agentic memory that distinguishes between session and long-term state while maintaining multi-tenant isolation.
Topics
- Agentic Memory
- Vector Databases
- Graph Databases
- Multi-tenancy
- Cognitive Science
- AI Infrastructure
- Knowledge Engineering
- LLM Orchestration
Highlights
- Main idea: Agentic memory must differentiate between short-term session state and long-term persistent knowledge
- Practical takeaway: Use a hybrid graph and vector approach to enable complex relationship retrieval and temporal searching
- Failure mode: Avoid naive summarization or uncontrolled fine-tuning as substitutes for structured memory systems
- Security insight: Implement physical isolation and multi-tenancy to prevent agents from accessing unauthorized data silos
- Future trend: The emergence of 'pseudo-languages' or structured SQL-like commands for more efficient multi-agent communication
Chapters
1:00Introduction to Knowledge Engineering: Vasilije Markovic discusses his transition from data engineering to cognitive science and the inspiration for building memory engines.5:30Defining Agentic State: The necessity of providing agents with a state representation to allow for continuity and data exchange across sessions.10:00Long-term vs. Session Memory: Exploring use cases for persistent memory, such as maintaining user profiles and breaking down data silos.18:30Optimizing Embedding Retrieval: Technical considerations for optimizing embeddings and managing large-scale retrieval for agentic workloads.23:10Temporal Relevance and Decay: How to model the temporal aspect of memory and handle information that becomes outdated over time.27:20Advanced Search Architectures: Implementing multi-modal search across graph and vector stores to find temporal and relational data.31:30Multi-tenancy and Security: Architecting isolated memory stores to ensure agents cannot cross-contaminate or access sensitive company data.