Episode

Autonomous Vehicle Research at Waymo

Podcast: Practical AI
Published: Nov 13, 2025
Duration seconds: 3128
Processing state: processed
Canonical source: https://share.transistor.fm/s/7b9cfe10
Audio: https://pscrb.fm/rss/p/dts.podtrac.com/redirect.mp3/media.transistor.fm/7b9cfe10/fe5031af.mp3
JSON: /v1/public/podcasts/practical-ai/episodes/autonomous-vehicle-research-at-waymo
Markdown: /podcast/practical-ai/autonomous-vehicle-research-at-waymo.md

Actions

POST https://stenobird.com/v1/public/podcasts/practical-ai/episodes/autonomous-vehicle-research-at-waymo/transcription-requests
Idempotently request low-priority transcript generation for this episode.
GET https://stenobird.com/podcast/practical-ai/autonomous-vehicle-research-at-waymo.md
Read the agent-friendly Markdown representation of this episode resource.

Summary

Waymo's VP of Research, Drago Anguelov, explains the technical duality of developing an onboard autonomous driver while simultaneously building massive-scale simulations. The discussion explores how foundation models and vision-language-action models are being integrated into the autonomy stack to improve safety and real-world performance.

Topics

Autonomous Vehicles
Foundation Models
Computer Vision
Large Language Models
Robotics
Simulation
Machine Learning
Waymo

Highlights

Main idea: Autonomous driving requires a dual-track approach: optimizing the onboard driver for real-time performance and building massive-scale simulators for validation
Practical takeaway: Large vision-language models (VLMs) are being used offboard to curate data and teach the onboard models through improved world knowledge
Failure mode: Relying solely on generative models for driving is risky; Waymo implements a 'safety harness' to validate and constrain model predictions
Technical challenge: Scaling simulation realism without exponential increases in compute costs is a critical bottleneck for verifying edge cases
Future direction: The integration of vision-language-action (VLA) models and the development of more generalizable, scalable world models

Chapters

4:45 Safety Milestones and Hardware Evolution: A look at Waymo's safety data showing a significant reduction in pedestrian incidents and the upcoming sixth-generation vehicle hardware.
8:55 Expanding Operational Domains: Discussing the expansion of autonomous services into new geographic areas and complex environments like highways.
13:05 Redundancy and System Reliability: The necessity of designing autonomous systems with hardware and compute redundancies to handle critical failures like steering issues.
16:55 Building Community Trust: How Waymo engages with local authorities, police, and city stewards to ensure safe integration into urban environments.
20:45 The Impact of Generative AI: Analyzing whether the generative AI boom and reasoning models will fundamentally change the approach to autonomous driving architectures.
24:15 Vision-Language-Action Models: Exploring the use of VLMs and VLAs in robotics to tie together understanding, language, and physical actions.
28:25 The Simulation vs. Reality Gap: The challenge of building exhaustive, high-fidelity simulations that can validate complex driving recipes.
32:10 Predicting Actions in Environments: The technical difficulty of using simulators to envision how an environment reacts to predicted autonomous actions.