{"podcast":{"title":"AI News Today | Julian Goldie Podcast","slug":"ai-news-today-julian-goldie-podcast-7573784","podcast_index_feed_id":7573784,"rss_url":"https://anchor.fm/s/10b0edd94/podcast/rss","website_url":"https://podcasters.spotify.com/pod/show/julian-goldie9","image_url":"https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/44704909/44704909-1761781825225-46220d4938e3.jpg","author":"Julian Goldie","episode_count":402,"summary":"Latest Podcast","last_synced_at":"2026-06-11T18:18:35.690849+00:00","page_url":"https://stenobird.com/podcast/ai-news-today-julian-goldie-podcast-7573784"},"episode":{"title":"Claude Opus 4.8 Just Changed AI Agents FOREVER!","slug":"claude-opus-4-8-just-changed-ai-agents-forever","published_at":"2026-05-29T20:56:00+00:00","page_url":"https://stenobird.com/podcast/ai-news-today-julian-goldie-podcast-7573784/claude-opus-4-8-just-changed-ai-agents-forever","show_page_url":"https://stenobird.com/podcast/ai-news-today-julian-goldie-podcast-7573784","url":"https://podcasters.spotify.com/pod/show/julian-goldie9/episodes/Claude-Opus-4-8-Just-Changed-AI-Agents-FOREVER-e3k2h67","audio_url":"https://anchor.fm/s/10b0edd94/podcast/play/120717959/https%3A%2F%2Fd3ctxlq1ktw2nl.cloudfront.net%2Fstaging%2F2026-4-29%2F425124757-44100-2-ac63724ba5803.mp3","summary":"Claude Opus 4.8: Dynamic Workflows, Hundreds of Agents, and the Benchmarks That MatterThe script explains Claude Opus 4.8’s release, highlighting “dynamic workflows” that let Claude spin up hundreds of coordinated agents in one session to plan, build, review, and self-check work, with agents able to run for days. It describes enabling this via Claude Code’s Ultra Code mode (and by asking Claude to create a dynamic workflow), and promotes an “agent operating system” inside the AI Profit Bot/AI Profit Boardroom with a zip file, tutorial, 30-day roadmap, prompts, weekly coaching calls, and a community. The video reviews benchmarks: SWE-bench Verified 88.6% and SWE-bench Pro 69.2% (ahead of GPT-5.5 and Gemini 3.1 Pro), Terminal Bench 74.6% (behind GPT-5.5), Frontier SWE rank #1, office-work GTP-Val where 4.8 beats GPT-5.5 in most matchups, improved Zapier workflow score, and a major long-context memory jump. It also notes reduced overclaiming, lower “sneaky” behavior, effort controls, faster/cheaper fast mode, and that Anthropic’s teased Mythos model may arrive soon. 00:00 Opus 4.8 Biggest Upgrade 00:52 Dynamic Workflows Explained 01:25 Ultra Code One Click Setup 01:51 Parallel Agent Speed Gains 02:33 Bun Rebuild Case Study 03:10 Agents Running For Days 03:33 Agent OS And Offer 04:26 Coding Benchmarks Breakdown 06:08 Office Legal Workflow Tests 07:17 Long Context Memory Leap 08:22 Honesty And Safety Improvements 09:15 Mythos Preview And Timeline 10:02 Effort Controls Pricing Speed 10:34 Wrap Up And Next Steps","meta_description":"Claude Opus 4.8: Dynamic Workflows, Hundreds of Agents, and the Benchmarks That MatterThe script explains Claude Opus 4.8’s release, highlighting “dynamic…","key_points":[],"chapters":[],"topics":[],"duration_seconds":733,"processing_state":"not_requested","actions":[{"name":"request_transcript","method":"POST","url":"https://stenobird.com/v1/public/podcasts/ai-news-today-julian-goldie-podcast-7573784/episodes/claude-opus-4-8-just-changed-ai-agents-forever/transcription-requests","description":"Idempotently request low-priority transcript generation for this episode."},{"name":"read_markdown","method":"GET","url":"https://stenobird.com/podcast/ai-news-today-julian-goldie-podcast-7573784/claude-opus-4-8-just-changed-ai-agents-forever.md","description":"Read the agent-friendly Markdown representation of this episode resource."}]}}