Episode

#243 – 'Godfather of AI' Yoshua Bengio: "I now see a path" to safe superintelligent AI

Podcast: 80,000 Hours Podcast
Published: May 7, 2026
Duration seconds: 9327
Processing state: not_requested
Canonical source: https://80000hours.org/podcast/episodes/yoshua-bengio-scientist-ai/?utm_campaign=podcast__yoshua-bengio&utm_source=80000+Hours+Podcast&utm_medium=podcast
Audio: https://media.transistor.fm/07925816/8b652e84.mp3
JSON: /v1/public/podcasts/80-000-hours-podcast-747608/episodes/243-godfather-of-ai-yoshua-bengio-i-now-see-a-path-to-safe-superintelligent-ai
Markdown: /podcast/80-000-hours-podcast-747608/243-godfather-of-ai-yoshua-bengio-i-now-see-a-path-to-safe-superintelligent-ai.md

Actions

POST https://stenobird.com/v1/public/podcasts/80-000-hours-podcast-747608/episodes/243-godfather-of-ai-yoshua-bengio-i-now-see-a-path-to-safe-superintelligent-ai/transcription-requests
Idempotently request low-priority transcript generation for this episode.
GET https://stenobird.com/podcast/80-000-hours-podcast-747608/243-godfather-of-ai-yoshua-bengio-i-now-see-a-path-to-safe-superintelligent-ai.md
Read the agent-friendly Markdown representation of this episode resource.

Summary

The co-inventor of modern AI and the most cited living scientist believes he's figured out how to ensure AI is honest, incapable of deception, and never goes rogue. Yoshua Bengio – Turing Award Winner and founder of LawZero – is disturbed by the many unintended drives and goals present in today's AIs, their willingness to lie, and ability to tell when they're being tested. AI companies are trying to stamp out these behaviours in a 'cat-and-mouse game' that Yoshua fears they're losing. --- Our new book is "a ridiculously in-depth guide to finding a fulfilling career that does good" and is out now! Order from your local bookstore, or online at https://80k.info/career-guide --- But Yoshua is optimistic: he believes the companies can win this battle decisively with a single rearrangement to how AI models are trained, and has been developing mathematical proofs to back up the claim. The core idea is that instead of training AI to predict what a human would say, or to produce responses we'd rate highly, we should train it to model what's actually true. Yoshua argues this new architecture, which he calls 'Scientist AI,' is a small enough change that we could keep almost all the techniques and data we use to train frontier AIs like Claude and ChatGPT. And that the new architecture need not cost more, could be built iteratively, and might be more capable as well as more honest. Links to learn more, video, and full transcript: https://80k.info/bengio Until recently, the biggest practical objection to Scientist AI was simple: the world wants agents, and Scientist AI isn’t one. But in new research, Yoshua has extended the design and believes the same honest predictor can be turned into a capable agent without losing its "safety guarantees." With the Scientist AI proposal on the ta…