Episode

Prof. Jakob Foerster - ImageNet Moment for Reinforcement Learning?

Podcast: Machine Learning Street Talk (MLST)
Published: Feb 18, 2025
Duration seconds: 3211
Processing state: processed
Canonical source: https://podcasters.spotify.com/pod/show/machinelearningstreettalk/episodes/Prof--Jakob-Foerster---ImageNet-Moment-for-Reinforcement-Learning-e2v2cl2
Audio: https://anchor.fm/s/1e4a0eac/podcast/play/98693218/https%3A%2F%2Fd3ctxlq1ktw2nl.cloudfront.net%2Fstaging%2F2025-1-18%2F52dc0817-4adf-4877-0b7b-e190a4542c3a.mp3
JSON: /v1/public/podcasts/machine-learning-street-talk/episodes/prof-jakob-foerster-imagenet-moment-for-reinforcement-learning
Markdown: /podcast/machine-learning-street-talk/prof-jakob-foerster-imagenet-moment-for-reinforcement-learning.md

Actions

POST https://stenobird.com/v1/public/podcasts/machine-learning-street-talk/episodes/prof-jakob-foerster-imagenet-moment-for-reinforcement-learning/transcription-requests
Idempotently request low-priority transcript generation for this episode.
GET https://stenobird.com/podcast/machine-learning-street-talk/prof-jakob-foerster-imagenet-moment-for-reinforcement-learning.md
Read the agent-friendly Markdown representation of this episode resource.

Summary

Prof. Jakob Foerster, a leading AI researcher at Oxford University and Meta, and Chris Lu, a researcher at OpenAI -- they explain how AI is moving beyond just mimicking human behaviour to creating truly intelligent agents that can learn and solve problems on their own. Foerster champions open-source AI for responsible, decentralised development. He addresses AI scaling, goal misalignment (Goodhart's Law), and the need for holistic alignment, offering a quick look at the future of AI and how to guide it. SPONSOR MESSAGES: *** CentML offers competitive pricing for GenAI model deployment, with flexible options to suit a wide range of models, from small to large-scale deployments. Check out their super fast DeepSeek R1 hosting! https://centml.ai/pricing/ Tufa AI Labs is a brand new research lab in Zurich started by Benjamin Crouzier focussed on o-series style reasoning and AGI. They are hiring a Chief Engineer and ML engineers. Events in Zurich. Goto https://tufalabs.ai/ *** TRANSCRIPT/REFS: https://www.dropbox.com/scl/fi/yqjszhntfr00bhjh6t565/JAKOB.pdf?rlkey=scvny4bnwj8th42fjv8zsfu2y&dl=0 Prof. Jakob Foerster https://x.com/j_foerst https://www.jakobfoerster.com/ University of Oxford Profile: https://eng.ox.ac.uk/people/jakob-foerster/ Chris Lu: https://chrislu.page/ TOC 1. GPU Acceleration and Training Infrastructure [00:00:00] 1.1 ARC Challenge Criticism and FLAIR Lab Overview [00:01:25] 1.2 GPU Acceleration and Hardware Lottery in RL [00:05:50] 1.3 Data Wall Challenges and Simulation-Based Solutions [00:08:40] 1.4 JAX Implementation and Technical Acceleration 2. Learning Frameworks and Policy Optimization [00:14:18] 2.1 Evolution of RL Algorithms and Mirror Learning Framework [00:15:25] 2.2 Meta-Learning and Policy Optimization Algorithms [00:21:47] 2.3 Language Mod…