Episode

977: Attention, World Models and the Future of AI, with Prof. Kyunghyun Cho

Podcast
Super Data Science: ML & AI Podcast with Jon Krohn
Published
Mar 24, 2026
Duration seconds
4694
Processing state
failed
Canonical source
https://www.podtrac.com/pts/redirect.mp3/chrt.fm/track/E581B9/arttrk.com/p/VI4CS/pscrb.fm/rss/p/traffic.megaphone.fm/SUPERDATASCIENCEPTYLTD7215350713.mp3?updated=1774337722
Audio
https://www.podtrac.com/pts/redirect.mp3/chrt.fm/track/E581B9/arttrk.com/p/VI4CS/pscrb.fm/rss/p/traffic.megaphone.fm/SUPERDATASCIENCEPTYLTD7215350713.mp3?updated=1774337722
JSON
/v1/public/podcasts/super-data-science/episodes/977-attention-world-models-and-the-future-of-ai-with-prof-kyunghyun-cho
Markdown
/podcast/super-data-science/977-attention-world-models-and-the-future-of-ai-with-prof-kyunghyun-cho.md

Actions

  • POST https://stenobird.com/v1/public/podcasts/super-data-science/episodes/977-attention-world-models-and-the-future-of-ai-with-prof-kyunghyun-cho/transcription-requests
    Idempotently request low-priority transcript generation for this episode.
  • GET https://stenobird.com/podcast/super-data-science/977-attention-world-models-and-the-future-of-ai-with-prof-kyunghyun-cho.md
    Read the agent-friendly Markdown representation of this episode resource.

Summary

What’s going to be the next big step function that blasts us forward in AI capabilities? To find out, Jon Krohn sits down with Professor Kyunghyun Cho, whose 200,000 citations and co-authorship of the first paper on attention place him among the most influential AI researchers in the world. In this episode, Kyunghyun explains why today’s models have already captured most correlations in passive data, making the real challenge about actively choosing which data to collect. He also weighs in on the open debate around world models, whether AI needs high-fidelity, step-by-step imagination or whether a high-level latent representation that lets it skip ahead is sufficient and shares the surprising discovery that 80% of his 200 computer science students had never installed a coding agent. Additional materials: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠www.superdatascience.com/977⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information. In this episode you will learn: (06:43) The story behind the attention mechanism (28:43) Sample efficiency and active data collection (39:04) World models and latent planning (49:52) Teaching undergrads with coding agents (58:21) Reranking, multi-stage ranking, and the foundations of RAG