Episode

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho

Podcast: Super Data Science: ML & AI Podcast with Jon Krohn
Published: Mar 24, 2026
Duration seconds: 4694
Processing state: not_requested
Canonical source: https://www.podtrac.com/pts/redirect.mp3/chrt.fm/track/E581B9/arttrk.com/p/VI4CS/pscrb.fm/rss/p/traffic.megaphone.fm/SUPERDATASCIENCEPTYLTD7215350713.mp3?updated=1774337722
Audio: https://www.podtrac.com/pts/redirect.mp3/chrt.fm/track/E581B9/arttrk.com/p/VI4CS/pscrb.fm/rss/p/traffic.megaphone.fm/SUPERDATASCIENCEPTYLTD7215350713.mp3?updated=1774337722
JSON: /v1/public/podcasts/super-data-science/episodes/977-attention-world-models-and-the-future-of-ai-with-prof-kyunghyun-cho
Markdown: /podcast/super-data-science/977-attention-world-models-and-the-future-of-ai-with-prof-kyunghyun-cho.md

Actions

POST https://stenobird.com/v1/public/podcasts/super-data-science/episodes/977-attention-world-models-and-the-future-of-ai-with-prof-kyunghyun-cho/transcription-requests
Idempotently request low-priority transcript generation for this episode.
GET https://stenobird.com/podcast/super-data-science/977-attention-world-models-and-the-future-of-ai-with-prof-kyunghyun-cho.md
Read the agent-friendly Markdown representation of this episode resource.

Summary

What’s going to be the next big step function that blasts us forward in AI capabilities? To find out, Jon Krohn sits down with Professor Kyunghyun Cho, whose 200,000 citations and co-authorship of the first paper on attention place him among the most influential AI researchers in the world. In this episode, Kyunghyun explains why today’s models have already captured most correlations in passive data, making the real challenge about actively choosing which data to collect. He also weighs in on the open debate around world models, whether AI needs high-fidelity, step-by-step imagination or whether a high-level latent representation that lets it skip ahead is sufficient and shares the surprising discovery that 80% of his 200 computer science students had never installed a coding agent. Additional materials: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠www.superdatascience.com/977⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (06:43) The story behind the attention mechanism (28:43) Sample efficiency and active data collection (39:04) World models and latent planning (49:52) Teaching undergrads with coding agents (58:21) Reranking, multi-stage ranking, and the foundations of RAG