Episode
#234 - Opus 4.6, GPT-5.3-codex, Seedance 2.0, GLM-5
- Podcast
- Last Week in AI
- Published
- Feb 16, 2026
- Duration seconds
- 5433
- Processing state
processed
Actions
POST https://stenobird.com/v1/public/podcasts/last-week-in-ai/episodes/234-opus-4-6-gpt-5-3-codex-seedance-2-0-glm-5/transcription-requests
Idempotently request low-priority transcript generation for this episode.GET https://stenobird.com/podcast/last-week-in-ai/234-opus-4-6-gpt-5-3-codex-seedance-2-0-glm-5.md
Read the agent-friendly Markdown representation of this episode resource.
Summary
A deep dive into the massive wave of new model releases from Anthropic, OpenAI, and Google, alongside the rise of high-performance Chinese models. The episode explores the shift from pure scaling to agentic workflows and the emerging risks of model incoherence.
Topics
- Anthropic Opus
- OpenAI GPT-5.3
- Google Gemini 3
- Generative Video
- AI Agents
- Large Language Models
- Cerebras
- Model Incoherence
Highlights
- Main idea: Anthropic's Opus 4.6 marks a transition from simple chat to 'agent teams' with massive context windows
- Practical takeaway: OpenAI's GPT-5.3 Codex is leveraging Cerebras hardware to achieve significant speed increases for coding tasks
- Failure mode: Increasing model scale may lead to 'incoherence,' where models become more capable but more unpredictable and jittery
- Market trend: Massive venture capital inflows, such as ElevenLabs' $11B valuation, signal a rapid maturation of the AI startup lifecycle
- Technical shift: The industry is moving toward hybrid architectures, like Qwen3 Coder, to balance efficiency with agentic capabilities
Chapters
9:00Anthropic's Shift to Agentic Workflows: Discussion on how Opus 4.6 is being positioned as a universal knowledge worker rather than just a developer tool.16:10OpenAI, Codex, and the Cerebras Advantage: Analyzing the performance gains of GPT-5.3 Codex and why the industry is looking for alternatives to the NVIDIA stack.30:35Google Gemini 3 and STEM Benchmarks: Evaluating Gemini 3 Deep Think's performance in science and engineering despite concerns over safety documentation.37:50The Rise of Chinese Generative Media: A look at ByteDance's Seedance 2.0 and Alibaba's Qwen Image 2.0 as serious competitors in the video and image space.51:25The Economics of AI Scaling: Examining the massive valuations of ElevenLabs and Runway and the sustainability of high-margin AI services.1:12:25The Problem of Model Incoherence: A technical discussion on how scaling compute might increase capability while simultaneously increasing task-level jitter and unpredictability.