# Autonomous Vehicle Research at Waymo Page: https://stenobird.com/podcast/practical-ai/autonomous-vehicle-research-at-waymo Text version: https://stenobird.com/podcast/practical-ai/autonomous-vehicle-research-at-waymo.md Podcast: [Practical AI](https://stenobird.com/podcast/practical-ai) Published: 2025-11-13T15:33:36+00:00 Episode link: https://share.transistor.fm/s/7b9cfe10 Audio file: https://pscrb.fm/rss/p/dts.podtrac.com/redirect.mp3/media.transistor.fm/7b9cfe10/fe5031af.mp3 Processing state: processed JSON: https://stenobird.com/v1/public/podcasts/practical-ai/episodes/autonomous-vehicle-research-at-waymo Duration seconds: 3128 ## Resource Waymo's VP of Research, Drago Anguelov, explains the technical duality of developing an onboard autonomous driver while simultaneously building massive-scale simulations. The discussion explores how foundation models and vision-language-action models are being integrated into the autonomy stack to improve safety and real-world performance. ## Highlights - Main idea: Autonomous driving requires a dual-track approach: optimizing the onboard driver for real-time performance and building massive-scale simulators for validation - Practical takeaway: Large vision-language models (VLMs) are being used offboard to curate data and teach the onboard models through improved world knowledge - Failure mode: Relying solely on generative models for driving is risky; Waymo implements a 'safety harness' to validate and constrain model predictions - Technical challenge: Scaling simulation realism without exponential increases in compute costs is a critical bottleneck for verifying edge cases - Future direction: The integration of vision-language-action (VLA) models and the development of more generalizable, scalable world models ## Topics Autonomous Vehicles, Foundation Models, Computer Vision, Large Language Models, Robotics, Simulation, Machine Learning, Waymo ## Chapters - 4:45 — Safety Milestones and Hardware Evolution: A look at Waymo's safety data showing a significant reduction in pedestrian incidents and the upcoming sixth-generation vehicle hardware. - 8:55 — Expanding Operational Domains: Discussing the expansion of autonomous services into new geographic areas and complex environments like highways. - 13:05 — Redundancy and System Reliability: The necessity of designing autonomous systems with hardware and compute redundancies to handle critical failures like steering issues. - 16:55 — Building Community Trust: How Waymo engages with local authorities, police, and city stewards to ensure safe integration into urban environments. - 20:45 — The Impact of Generative AI: Analyzing whether the generative AI boom and reasoning models will fundamentally change the approach to autonomous driving architectures. - 24:15 — Vision-Language-Action Models: Exploring the use of VLMs and VLAs in robotics to tie together understanding, language, and physical actions. - 28:25 — The Simulation vs. Reality Gap: The challenge of building exhaustive, high-fidelity simulations that can validate complex driving recipes. - 32:10 — Predicting Actions in Environments: The technical difficulty of using simulators to envision how an environment reacts to predicted autonomous actions. ## Actions - request_transcript: `POST https://stenobird.com/v1/public/podcasts/practical-ai/episodes/autonomous-vehicle-research-at-waymo/transcription-requests` — Idempotently request low-priority transcript generation for this episode. - read_markdown: `GET https://stenobird.com/podcast/practical-ai/autonomous-vehicle-research-at-waymo.md` — Read the agent-friendly Markdown representation of this episode resource. A page view does not enqueue transcription. Agents should invoke `request_transcript` explicitly when they need this episode processed. ## Transcript Full transcripts are not published on public pages unless there is a clear rights basis.