{"podcast":{"title":"80,000 Hours Podcast","slug":"80-000-hours-podcast-747608","podcast_index_feed_id":747608,"rss_url":"https://feeds.feedburner.com/80000HoursPodcast","website_url":"https://80000hours.org/podcast/","image_url":"https://img.transistorcdn.com/8dMmMNaSGF1OGBLc-bkepIvasPxJeaJRFTWpl_UqjYU/rs:fill:0:0:1/w:1400/h:1400/q:60/mb:500000/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9zaG93/LzQxNDAyLzE2ODM1/NDQ1NDAtYXJ0d29y/ay5qcGc.jpg","author":"The 80,000 Hours team","episode_count":339,"summary":"The most important conversations about artificial intelligence you won’t hear anywhere else. Subscribe by searching for '80000 Hours' wherever you get podcasts. Hosted by Rob Wiblin, Luisa Rodriguez, and Zershaaneh Qureshi.","last_synced_at":"2026-06-12T06:18:33.680116+00:00","page_url":"https://stenobird.com/podcast/80-000-hours-podcast-747608"},"episode":{"title":"Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)","slug":"can-ais-already-start-rogue-deployments-inside-ai-companies-landmark-new-metr-report","published_at":"2026-05-20T15:23:55+00:00","page_url":"https://stenobird.com/podcast/80-000-hours-podcast-747608/can-ais-already-start-rogue-deployments-inside-ai-companies-landmark-new-metr-report","show_page_url":"https://stenobird.com/podcast/80-000-hours-podcast-747608","url":"https://80000hours.org/podcast/episodes/metr-risk-report-red-team/?utm_campaign=podcast__metr-risk-report&utm_source=80000+Hours+Podcast&utm_medium=podcast","audio_url":"https://media.transistor.fm/262b7d74/f731b819.mp3","summary":"A red-teamer was embedded inside Anthropic for three weeks, told to imagine he was an evil Claude, and asked to figure out how to launch a ‘rogue AI deployment’ without getting caught. It’s one part of a landmark report released yesterday by METR — the outfit behind the task-completion time horizon graph which has become the single most watched measure of AI progress. This major new research push is being conducted with close collaboration from OpenAI, Google DeepMind, Meta, and Anthropic, and led by METR researchers Hjalmar Wijk and Ajeya Cotra. It represents the first systematic study of what newly trained AI models could get away with inside the companies that built them, before anyone outside the company even knows they exist. The conclusion: AI models now have the means, the motive, and the opportunity to start “minimal rogue deployments” in pursuit of their own independent goals, like acquiring more compute, at all four companies studied. David Rein, the red-teamer placed inside Anthropic, identified a number of weaknesses models could exploit there: expansive permissions, cloud jobs outside of monitoring, and monitors that are trivial to jailbreak. But he also found that frontier models were comically bad at key parts of the process, which means they can’t cause meaningful damage for now. In this video, Rob Wiblin reconciles the conflicting picture and looks forward to METR’s second round of stress tests. They’ll begin in just a few months, a necessary move with AI advancing so quickly. This episode was recorded on May 15, 2026. Learn more, video, and full transcript: https://80k.info/metr-report Chapters: What could an unreleased AI get away with? – the new METR report (00:00:00) Motive: Why grab more compute? (00:01:54) Opportunity: YOLO mode and jailbreaks (0…","meta_description":"A red-teamer was embedded inside Anthropic for three weeks, told to imagine he was an evil Claude, and asked to figure out how to launch a ‘rogue AI deplo…","key_points":[],"chapters":[],"topics":[],"duration_seconds":1202,"processing_state":"not_requested","actions":[{"name":"request_transcript","method":"POST","url":"https://stenobird.com/v1/public/podcasts/80-000-hours-podcast-747608/episodes/can-ais-already-start-rogue-deployments-inside-ai-companies-landmark-new-metr-report/transcription-requests","description":"Idempotently request low-priority transcript generation for this episode."},{"name":"read_markdown","method":"GET","url":"https://stenobird.com/podcast/80-000-hours-podcast-747608/can-ais-already-start-rogue-deployments-inside-ai-companies-landmark-new-metr-report.md","description":"Read the agent-friendly Markdown representation of this episode resource."}]}}