{"podcast":{"title":"AI News Today | Julian Goldie Podcast","slug":"ai-news-today-julian-goldie-podcast-7573784","podcast_index_feed_id":7573784,"rss_url":"https://anchor.fm/s/10b0edd94/podcast/rss","website_url":"https://podcasters.spotify.com/pod/show/julian-goldie9","image_url":"https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/44704909/44704909-1761781825225-46220d4938e3.jpg","author":"Julian Goldie","episode_count":402,"summary":"Latest Podcast","last_synced_at":"2026-06-11T18:18:35.690849+00:00","page_url":"https://stenobird.com/podcast/ai-news-today-julian-goldie-podcast-7573784"},"episode":{"title":"China’s Qwen 3.7 Max DESTROYS Claude?","slug":"china-s-qwen-3-7-max-destroys-claude","published_at":"2026-05-29T11:04:14+00:00","page_url":"https://stenobird.com/podcast/ai-news-today-julian-goldie-podcast-7573784/china-s-qwen-3-7-max-destroys-claude","show_page_url":"https://stenobird.com/podcast/ai-news-today-julian-goldie-podcast-7573784","url":"https://podcasters.spotify.com/pod/show/julian-goldie9/episodes/Chinas-Qwen-3-7-Max-DESTROYS-Claude-e3k2cc9","audio_url":"https://anchor.fm/s/10b0edd94/podcast/play/120713033/https%3A%2F%2Fd3ctxlq1ktw2nl.cloudfront.net%2Fstaging%2F2026-4-29%2F425119090-44100-2-58a5091e9905d.mp3","summary":"Qwen 3.7 Max: Alibaba’s 35-Hour Autonomous Agent Demo + Claude-Beating Benchmarks (With Caveats)Alibaba’s new flagship model, Qwen 3.7 Max, was unveiled around the Alibaba Cloud Summit in Hangzhou (May 20, 2026) and is positioned as a closed, proprietary frontier model aimed at enterprise, narrowing the gap with Claude Opus 4.7 while costing less per token. The script highlights strong agentic benchmarks (e.g., Terminal Bench 2.0, SWE-Bench Pro, MC Atlas, GPQA Diamond) and broad compatibility with agent frameworks and APIs (OpenAI and Anthropic specs), plus availability across multiple platforms. It also stresses caveats: the model is unusually verbose, which can raise real costs, and it has a low hallucination rate partly due to a much lower attempt rate. A headline 35-hour autonomous optimization demo (vendor-stated, not independently verified) reportedly achieved a 10× speedup on Alibaba’s Shenwu M890 chip kernel. 00:00 Qwen Shocks The Frontier 01:34 What Qwen 3.7 Max Is 02:17 Agent Framework Compatibility 02:55 Benchmark Wins Explained 04:12 Pricing And Token Trap 05:29 Hallucinations Versus Refusals 06:27 Inside The 35 Hour Demo 08:19 Hermes Agent Integration 09:50 Should You Switch Now 11:39 How To Test And Deploy 12:36 Stop Waiting Start Building 14:31 Final Takeaways And Caveats","meta_description":"Qwen 3.7 Max: Alibaba’s 35-Hour Autonomous Agent Demo + Claude-Beating Benchmarks (With Caveats)Alibaba’s new flagship model, Qwen 3.7 Max, was unveiled a…","key_points":[],"chapters":[],"topics":[],"duration_seconds":911,"processing_state":"not_requested","actions":[{"name":"request_transcript","method":"POST","url":"https://stenobird.com/v1/public/podcasts/ai-news-today-julian-goldie-podcast-7573784/episodes/china-s-qwen-3-7-max-destroys-claude/transcription-requests","description":"Idempotently request low-priority transcript generation for this episode."},{"name":"read_markdown","method":"GET","url":"https://stenobird.com/podcast/ai-news-today-julian-goldie-podcast-7573784/china-s-qwen-3-7-max-destroys-claude.md","description":"Read the agent-friendly Markdown representation of this episode resource."}]}}