Episode

"Automated Alignment is Harder Than You Think" by Aleksandr Bowkis, Marie_DB, Jacob Pfau, Geoffrey Irving

Podcast
LessWrong (Curated & Popular)
Published
May 17, 2026
Duration seconds
471
Processing state
not_requested
Canonical source
https://www.buzzsprout.com/2037297/episodes/19189323-automated-alignment-is-harder-than-you-think-by-aleksandr-bowkis-marie_db-jacob-pfau-geoffrey-irving.mp3
Audio
https://www.buzzsprout.com/2037297/episodes/19189323-automated-alignment-is-harder-than-you-think-by-aleksandr-bowkis-marie_db-jacob-pfau-geoffrey-irving.mp3
JSON
/v1/public/podcasts/lesswrong-curated-popular-5643401/episodes/automated-alignment-is-harder-than-you-think-by-aleksandr-bowkis-marie-db-jacob-pfau-geoffrey-irving
Markdown
/podcast/lesswrong-curated-popular-5643401/automated-alignment-is-harder-than-you-think-by-aleksandr-bowkis-marie-db-jacob-pfau-geoffrey-irving.md

Actions

  • POST https://stenobird.com/v1/public/podcasts/lesswrong-curated-popular-5643401/episodes/automated-alignment-is-harder-than-you-think-by-aleksandr-bowkis-marie-db-jacob-pfau-geoffrey-irving/transcription-requests
    Idempotently request low-priority transcript generation for this episode.
  • GET https://stenobird.com/podcast/lesswrong-curated-popular-5643401/automated-alignment-is-harder-than-you-think-by-aleksandr-bowkis-marie-db-jacob-pfau-geoffrey-irving.md
    Read the agent-friendly Markdown representation of this episode resource.

Summary

Summary This is a summary of a paper published by the alignment team at UK AISI. Read the full paper here. AI research agents may help solve ASI alignment, for example via the following plan: Build agents that can do empirical alignment work (e.g.~writing code, running experiments, designing evaluations and red teaming) and confirm they are not scheming.[1]Use these agents to build increasingly sophisticated empirical safety cases for each successive generation of agents, gradually aut...