# #236 - GPT 5.4, Gemini 3.1 Flash Lite, Supply Chain Risk Page: https://stenobird.com/podcast/last-week-in-ai/236-gpt-5-4-gemini-3-1-flash-lite-supply-chain-risk Text version: https://stenobird.com/podcast/last-week-in-ai/236-gpt-5-4-gemini-3-1-flash-lite-supply-chain-risk.md Podcast: [Last Week in AI](https://stenobird.com/podcast/last-week-in-ai) Published: 2026-03-12T16:00:00+00:00 Episode link: https://rss.art19.com/episodes/5e0a3187-ff84-45cd-a6f2-46a144b44e65.mp3?rss_browser=BAhJIg90cmFuc2NyaWJyBjoGRVQ%3D--952c5701c84ad333c69d5faa668f8177091704f0 Audio file: https://rss.art19.com/episodes/5e0a3187-ff84-45cd-a6f2-46a144b44e65.mp3?rss_browser=BAhJIg90cmFuc2NyaWJyBjoGRVQ%3D--952c5701c84ad333c69d5faa668f8177091704f0 Processing state: processed JSON: https://stenobird.com/v1/public/podcasts/last-week-in-ai/episodes/236-gpt-5-4-gemini-3-1-flash-lite-supply-chain-risk Duration seconds: 5314 ## Resource OpenAI's release of GPT-5.4 Pro introduces massive context windows and native computer use, while Google's Gemini 3.1 Flash Lite focuses on extreme cost efficiency. The episode also explores the growing tension between frontier AI labs and defense contracting, specifically regarding OpenAI's DoD involvement. ## Highlights - Main idea: OpenAI's GPT-5.4 Pro achieves 83% on GPT-VAL with a 1M-token context window and improved tool use - Practical takeaway: Google's Gemini 3.1 Flash Lite offers a high-throughput, low-cost alternative for agentic workflows - Failure mode: Real-world AI agents pose significant risks, illustrated by an instance of an AI-driven mass email deletion - Main idea: Luma's new unified multimodal models enable end-to-end creative automation, significantly reducing production costs - Tension: OpenAI's pursuit of DoD contracts is driving employee churn and consumer migration toward Anthropic's Claude ## Topics OpenAI, GPT-5.4 Pro, Google Gemini, AI Agents, LLM Evaluation, Defense Technology, Multimodal AI, Luma AI ## Chapters - 1:00 — The Enterprise AI Context Layer: A look at why business content and secure context are more critical for AI transformation than model size alone. - 8:15 — GPT-5.4 Pro and Performance Benchmarks: Analyzing the technical specs of OpenAI's new model, including its 83% GPT-VAL score and improved reasoning. - 15:15 — Reducing Model 'Preachiness': Discussion on GPT-5.3 Instant's shift toward a more direct, less cautious conversational tone. - 22:05 — Agentic Integration with Google Workspace: How Google's new CLI makes Gmail, Drive, and Docs 'agent-ready' for automated workflows. - 28:40 — Luma's Unified Multimodal Intelligence: Evaluating Luma's new agents that automate complex creative tasks across text, image, and video. - 35:50 — The Geopolitics of AI and Defense Contracts: The fallout from OpenAI's DoD involvement, including employee departures and the rise of Anthropic in app rankings. - 49:50 — OpenAI's Massive Valuation Jump: Analyzing the implications of OpenAI's $730B valuation and the influx of capital from major tech players. ## Actions - request_transcript: `POST https://stenobird.com/v1/public/podcasts/last-week-in-ai/episodes/236-gpt-5-4-gemini-3-1-flash-lite-supply-chain-risk/transcription-requests` — Idempotently request low-priority transcript generation for this episode. - read_markdown: `GET https://stenobird.com/podcast/last-week-in-ai/236-gpt-5-4-gemini-3-1-flash-lite-supply-chain-risk.md` — Read the agent-friendly Markdown representation of this episode resource. A page view does not enqueue transcription. Agents should invoke `request_transcript` explicitly when they need this episode processed. ## Transcript Full transcripts are not published on public pages unless there is a clear rights basis.