{"podcast":{"title":"Last Week in AI","slug":"last-week-in-ai","podcast_index_feed_id":396447,"rss_url":"https://rss.art19.com/last-week-in-ai","website_url":"https://art19.com/shows/last-week-in-ai","image_url":"https://content.production.cdn.art19.com/images/d8/60/88/b2/d86088b2-d713-4824-8483-a985aa7d7f32/e4063a3a93d1635f5b88961b422beb3e4fb4feab7fa085837e15faa5db2703d1830d964620373fcc524cfeee13ef3402821ce39d8fa98fd77271c57a80e7f24d.jpeg","author":"Skynet Today","episode_count":282,"summary":"Weekly summaries of the AI news that matters!","last_synced_at":null,"page_url":"https://stenobird.com/podcast/last-week-in-ai"},"episode":{"title":"#228 - GPT 5.2, Scaling Agents, Weird Generalization","slug":"228-gpt-5-2-scaling-agents-weird-generalization","published_at":"2025-12-17T08:00:00+00:00","page_url":"https://stenobird.com/podcast/last-week-in-ai/228-gpt-5-2-scaling-agents-weird-generalization","show_page_url":"https://stenobird.com/podcast/last-week-in-ai","url":"https://rss.art19.com/episodes/ff43c594-5876-4808-9d7e-4ff32cca7d5b.mp3?rss_browser=BAhJIg90cmFuc2NyaWJyBjoGRVQ%3D--952c5701c84ad333c69d5faa668f8177091704f0","audio_url":"https://rss.art19.com/episodes/ff43c594-5876-4808-9d7e-4ff32cca7d5b.mp3?rss_browser=BAhJIg90cmFuc2NyaWJyBjoGRVQ%3D--952c5701c84ad333c69d5faa668f8177091704f0","summary":"OpenAI's GPT-5.2 release marks a significant leap in multi-modal performance, though it introduces new cost and knowledge cutoff challenges. The episode also explores the massive $1 billion Disney-OpenAI partnership and the complexities of scaling multi-agent systems.","meta_description":"Explore the impact of GPT-5.2, Disney's $1B OpenAI investment, and the latest research on scaling multi-agent AI systems and robotics.","key_points":["Main idea: GPT-5.2 demonstrates superior reasoning on benchmarks like Suibench Pro compared to Claude 4.5 Opus","Business shift: Disney's $1 billion investment in OpenAI aims to integrate Marvel, Pixar, and Star Wars characters into Sora","Practical takeaway: Scaling multi-agent systems requires solving complex tool coordination and task performance challenges","Failure mode: Relying solely on increased compute (software-only singularity) may not be enough to reach superintelligence without algorithmic breakthroughs","Geopolitical tension: New U.S. chip export rules and investigations into smuggling networks highlight AI hardware as critical national security infrastructure"],"chapters":[{"start_ms":470000,"title":"GPT-5.2 Performance vs Claude 4.5","summary":"A comparison of reasoning capabilities, noting GPT-5.2's top-tier performance on Suibench Pro."},{"start_ms":875000,"title":"Product Updates: Adobe & Google","summary":"Discussion on ChatGPT's new integration with Adobe apps and Google's approach to linking AI sources."},{"start_ms":1260000,"title":"Global Chip Competition","summary":"The struggle for Nvidia H200 chips in China and the implications of U.S. export controls."},{"start_ms":1650000,"title":"The Rise of Neuromorphic Computing","summary":"Unconventional AI's massive seed round and the pursuit of energy-efficient, biological-style computing."},{"start_ms":2880000,"title":"The Science of Scaling Agents","summary":"DeepMind's research into the difficulties of coordinating multiple agents in complex environments."},{"start_ms":4085000,"title":"Stability in LLM Reasoning","summary":"Exploring mathematical approaches to maintaining stability during intermediate reasoning steps."}],"topics":["OpenAI","GPT-5.2","Multi-agent systems","AI hardware","Robotics","Machine Learning","Generative Video","AI Regulation"],"duration_seconds":5202,"processing_state":"processed","actions":[{"name":"request_transcript","method":"POST","url":"https://stenobird.com/v1/public/podcasts/last-week-in-ai/episodes/228-gpt-5-2-scaling-agents-weird-generalization/transcription-requests","description":"Idempotently request low-priority transcript generation for this episode."},{"name":"read_markdown","method":"GET","url":"https://stenobird.com/podcast/last-week-in-ai/228-gpt-5-2-scaling-agents-weird-generalization.md","description":"Read the agent-friendly Markdown representation of this episode resource."}]}}