{"podcast":{"title":"Last Week in AI","slug":"last-week-in-ai","podcast_index_feed_id":396447,"rss_url":"https://rss.art19.com/last-week-in-ai","website_url":"https://art19.com/shows/last-week-in-ai","image_url":"https://content.production.cdn.art19.com/images/d8/60/88/b2/d86088b2-d713-4824-8483-a985aa7d7f32/e4063a3a93d1635f5b88961b422beb3e4fb4feab7fa085837e15faa5db2703d1830d964620373fcc524cfeee13ef3402821ce39d8fa98fd77271c57a80e7f24d.jpeg","author":"Skynet Today","episode_count":282,"summary":"Weekly summaries of the AI news that matters!","last_synced_at":null,"page_url":"https://stenobird.com/podcast/last-week-in-ai"},"episode":{"title":"#243 - GPT 5.5, DeepSeek V4, AI safety sabotage","slug":"243-gpt-5-5-deepseek-v4-ai-safety-sabotage","published_at":"2026-05-03T07:30:00+00:00","page_url":"https://stenobird.com/podcast/last-week-in-ai/243-gpt-5-5-deepseek-v4-ai-safety-sabotage","show_page_url":"https://stenobird.com/podcast/last-week-in-ai","url":"https://rss.art19.com/episodes/a8a95861-9763-4fe9-9282-61509b9f258c.mp3?rss_browser=BAhJIg90cmFuc2NyaWJyBjoGRVQ%3D--952c5701c84ad333c69d5faa668f8177091704f0","audio_url":"https://rss.art19.com/episodes/a8a95861-9763-4fe9-9282-61509b9f258c.mp3?rss_browser=BAhJIg90cmFuc2NyaWJyBjoGRVQ%3D--952c5701c84ad333c69d5faa668f8177091704f0","summary":"Our 243rd episode with a summary and discussion of last week's big AI news! Recorded on 04/29/2026 Hosted by Andrey Kurenkov and Jeremie Harris Feel free to email us your questions and feedback at andreyvkurenkov@gmail.com and/or&nbsp; hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/ In this episode: OpenAI released GPT-5.5 with strong coding-oriented improvements, a system card discussing chain-of-thought monitorability and misalignment testing, higher pricing than GPT-5.4, and notable quirks like a system-prompt warning about “goblins.” xAI launched Grok Voice Think Fast 1.0, claiming large benchmark leads for real-time voice agents and reporting major Starlink customer-support automation and sales conversion impact. DeepSeek open-sourced DeepSeek V4 (Pro and Flash) featuring MoE scaling and 1M-token context via hybrid/compressed attention changes, while Tencent released Hunyuan 3 preview with weaker benchmark performance; a new long-horizon agent benchmark (Clawmark) shows low task success rates. Major business, legal, and policy updates include Google’s planned up-to-$40B investment and 5GW compute commitment to Anthropic, Meta’s AWS Gravitron deal and China blocking Meta’s Manus acquisition, a revamped OpenAI–Microsoft agreement, ongoing Musk–OpenAI trial developments, and new safety/security research on sabotage, document degradation under delegation, and bit-flip attacks. Timestamps: (00:00:10) Intro / Banter (00:02:00) News Preview (00:02:26) Response to listener comments (00:02:55) Sponsors Tools &amp; Apps (00:05:55) OpenAI Unveils Its New, More Powerful GPT-5.5 Model - The New York Times (00:23:33) xAI Launches grok-voice-think-fast-1.0: Topping τ-voice Bench at 67.3%, Outperforming Gemini, GPT Realtime, an…","meta_description":"Our 243rd episode with a summary and discussion of last week's big AI news! Recorded on 04/29/2026 Hosted by Andrey Kurenkov and Jeremie Harris Feel free…","key_points":[],"chapters":[],"topics":[],"duration_seconds":6742,"processing_state":"processed","actions":[{"name":"request_transcript","method":"POST","url":"https://stenobird.com/v1/public/podcasts/last-week-in-ai/episodes/243-gpt-5-5-deepseek-v4-ai-safety-sabotage/transcription-requests","description":"Idempotently request low-priority transcript generation for this episode."},{"name":"read_markdown","method":"GET","url":"https://stenobird.com/podcast/last-week-in-ai/243-gpt-5-5-deepseek-v4-ai-safety-sabotage.md","description":"Read the agent-friendly Markdown representation of this episode resource."}]}}