Episode
Making deep learning perform real algorithms with Category Theory (Andrew Dudzik, Petar Velichkovich, Taco Cohen, Bruno Gavranović, Paul Lessard)
- Published
- Dec 22, 2025
- Duration seconds
- 2637
- Processing state
processed
Actions
POST https://stenobird.com/v1/public/podcasts/machine-learning-street-talk/episodes/making-deep-learning-perform-real-algorithms-with-category-theory-andrew-dudzik-petar-velichkovich-taco-cohen-bruno-gavranovi-paul-lessard/transcription-requests
Idempotently request low-priority transcript generation for this episode.GET https://stenobird.com/podcast/machine-learning-street-talk/making-deep-learning-perform-real-algorithms-with-category-theory-andrew-dudzik-petar-velichkovich-taco-cohen-bruno-gavranovi-paul-lessard.md
Read the agent-friendly Markdown representation of this episode resource.
Summary
Modern LLMs struggle with basic algorithmic tasks like addition because they rely on pattern recognition rather than internal computational logic. This discussion explores using Category Theory to move deep learning from empirical 'alchemy' to a principled engineering discipline.
Topics
- Category Theory
- Deep Learning
- Large Language Models
- Geometric Deep Learning
- Algorithmic Reasoning
- Neural Network Architecture
- Mathematical Logic
- Computational Complexity
Highlights
- Main idea: LLMs lack the internal machinery for operations like carrying digits, making them prone to failure on simple arithmetic
- Failure mode: Relying on tool-use or chain-of-thought acts as a patch rather than fixing the underlying architectural inability to process algorithms
- Practical takeaway: Moving from 'Analytic' to 'Synthetic' mathematics can help models internalize rules rather than just observing data
- Main idea: Category Theory offers a 'Periodic Table' for neural networks, allowing for the composition of complex operations from simpler ones
- Technical vision: The goal is to build neural networks that can execute computation in a way that is mathematically reasonble and robust
Chapters
1:00The Failure of LLM Addition: Why language models fail at basic arithmetic despite appearing competent through pattern recognition.4:35Broadening Geometric Deep Learning: The need to expand the lens of geometric deep learning to include more complex algorithmic structures.7:35Invariance and Preconditions: Discussing the challenges of maintaining computational invariants when pushing data through functions.10:40Compositionality and Color Violations: Using the analogy of 'colors' to explain how types must match for successful algebraic composition.14:00Moving Beyond Deep Learning Alchemy: The transition from ad hoc architectural design choices to a principled, theory-driven foundation.17:35Aligning Models to Algorithms: The difficulty of aligning neural architectures with classical algorithmic computation and permutation invariance.21:00The Complexity of Multiplication: Why scaling inputs makes the translation between embeddings and algorithms increasingly prone to failure.24:20Synthetic Mathematics: A shift toward a mathematical framework where principles are abstracted rather than just observed.