{"podcast":{"title":"Machine Learning Street Talk (MLST)","slug":"machine-learning-street-talk","podcast_index_feed_id":781643,"rss_url":"https://anchor.fm/s/1e4a0eac/podcast/rss","website_url":"https://podcasters.spotify.com/pod/show/machinelearningstreettalk","image_url":"https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/4981699/4981699-1757416025703-f026fa81b6d04.jpg","author":"Machine Learning Street Talk (MLST)","episode_count":254,"summary":"Welcome! We engage in fascinating discussions with pre-eminent figures in the AI field. Our flagship show covers current affairs in AI, cognitive science, neuroscience and philosophy of mind with in-depth analysis. Our approach is unrivalled in terms of scope and rigour – we believe in intellectual diversity in AI, and we touch on all of the main ideas in the field with the hype surgically removed. MLST is run by Tim Scarfe, Ph.D (https://www.linkedin.com/in/ecsquizor/) and features regular appearances from MIT Doctor of Philosophy Keith Duggar (https://www.linkedin.com/in/dr-keith-duggar/).","last_synced_at":"2026-06-29T06:18:19.444379+00:00","page_url":"https://stenobird.com/podcast/machine-learning-street-talk"},"episode":{"title":"New top score on ARC-AGI-2-pub (29.4%) - Jeremy Berman","slug":"new-top-score-on-arc-agi-2-pub-29-4-jeremy-berman","published_at":"2025-09-27T16:21:01+00:00","page_url":"https://stenobird.com/podcast/machine-learning-street-talk/new-top-score-on-arc-agi-2-pub-29-4-jeremy-berman","show_page_url":"https://stenobird.com/podcast/machine-learning-street-talk","url":"https://podcasters.spotify.com/pod/show/machinelearningstreettalk/episodes/New-top-score-on-ARC-AGI-2-pub-29-4---Jeremy-Berman-e38pj96","audio_url":"https://traffic.megaphone.fm/APO8526044538.mp3","summary":"Jeremy Berman explains how shifting from Python code generation to natural language instructions allowed his system to achieve a top score on the ARC-AGI-2-pub leaderboard. The discussion explores the transition from pattern memorization to true algorithmic reasoning and the potential for models to synthesize new knowledge.","meta_description":"Explore the breakthrough approach to ARC-AGI-2-pub using natural language evolution and the debate between neural networks and symbolic reasoning.","key_points":["Main idea: Natural language provides a more expressive programming medium than Python for solving complex visual reasoning tasks","Practical takeaway: In the ARC-AGI-2-pub challenge, a stronger 'checker' model is more critical for success than a stronger 'instruction creator'","Failure mode: Relying solely on pre-training can actually hinder reasoning by encouraging pattern memorization over logical deduction","Technical insight: The trade-off in ARC-AGI-2-pub involves balancing the breadth of the search space with the depth of the instruction complexity","Future vision: True AGI requires a meta-skill for reasoning that allows models to learn and synthesize new skills without losing existing knowledge"],"chapters":[{"start_ms":60000,"title":"The Goal of Knowledge Synthesis","summary":"Discussing the need for AI to move beyond data compression toward systems that can integrate and learn new information dynamically."},{"start_ms":370000,"title":"Evolutionary Program Synthesis","summary":"A look at the transition from program synthesis to reinforcement learning with verifiable feedback."},{"start_ms":700000,"title":"The Shift to Natural Language","summary":"Why moving from Python to English instructions improved accuracy by increasing the degrees of freedom in the solution space."},{"start_ms":1025000,"title":"Neural Networks vs. Turing Completeness","summary":"Debating whether LLMs possess true intelligence or are simply searching through the space of Turing programs."},{"start_ms":1325000,"title":"The Challenge of Continual Learning","summary":"Exploring the possibility of freezing expert layers to allow for new learning without catastrophic forgetting."},{"start_ms":1655000,"title":"The Power of Expressive Programs","summary":"Analyzing how combining neural networks with a Python terminal can bridge the gap between intuition and execution."},{"start_ms":3250000,"title":"Pre-training as a Barrier to Reasoning","summary":"A provocative take on how massive pre-training might act as a 'consultant' that knows names but lacks deductive capability."}],"topics":["ARC-AGI","Program Synthesis","Natural Language Processing","Reinforcement Learning","Artificial General Intelligence","Symbolic Reasoning","Evolutionary Algorithms","Machine Learning"],"duration_seconds":4107,"processing_state":"processed","actions":[{"name":"request_transcript","method":"POST","url":"https://stenobird.com/v1/public/podcasts/machine-learning-street-talk/episodes/new-top-score-on-arc-agi-2-pub-29-4-jeremy-berman/transcription-requests","description":"Idempotently request low-priority transcript generation for this episode."},{"name":"read_markdown","method":"GET","url":"https://stenobird.com/podcast/machine-learning-street-talk/new-top-score-on-arc-agi-2-pub-29-4-jeremy-berman.md","description":"Read the agent-friendly Markdown representation of this episode resource."}]}}