Episode

Autonomous Vehicle Research at Waymo

Podcast
Practical AI
Published
Nov 13, 2025
Duration seconds
3128
Processing state
processed
Canonical source
https://share.transistor.fm/s/7b9cfe10
Audio
https://pscrb.fm/rss/p/dts.podtrac.com/redirect.mp3/media.transistor.fm/7b9cfe10/fe5031af.mp3
JSON
/v1/public/podcasts/practical-ai/episodes/autonomous-vehicle-research-at-waymo
Markdown
/podcast/practical-ai/autonomous-vehicle-research-at-waymo.md

Actions

  • POST https://stenobird.com/v1/public/podcasts/practical-ai/episodes/autonomous-vehicle-research-at-waymo/transcription-requests
    Idempotently request low-priority transcript generation for this episode.
  • GET https://stenobird.com/podcast/practical-ai/autonomous-vehicle-research-at-waymo.md
    Read the agent-friendly Markdown representation of this episode resource.

Summary

Waymo's VP of Research, Drago Anguelov, explains the technical duality of developing an onboard autonomous driver while simultaneously building massive-scale simulations. The discussion explores how foundation models and vision-language-action models are being integrated into the autonomy stack to improve safety and real-world performance.

Topics

  • Autonomous Vehicles
  • Foundation Models
  • Computer Vision
  • Large Language Models
  • Robotics
  • Simulation
  • Machine Learning
  • Waymo

Highlights

  • Main idea: Autonomous driving requires a dual-track approach: optimizing the onboard driver for real-time performance and building massive-scale simulators for validation
  • Practical takeaway: Large vision-language models (VLMs) are being used offboard to curate data and teach the onboard models through improved world knowledge
  • Failure mode: Relying solely on generative models for driving is risky; Waymo implements a 'safety harness' to validate and constrain model predictions
  • Technical challenge: Scaling simulation realism without exponential increases in compute costs is a critical bottleneck for verifying edge cases
  • Future direction: The integration of vision-language-action (VLA) models and the development of more generalizable, scalable world models

Chapters

  1. 4:45 Safety Milestones and Hardware Evolution: A look at Waymo's safety data showing a significant reduction in pedestrian incidents and the upcoming sixth-generation vehicle hardware.
  2. 8:55 Expanding Operational Domains: Discussing the expansion of autonomous services into new geographic areas and complex environments like highways.
  3. 13:05 Redundancy and System Reliability: The necessity of designing autonomous systems with hardware and compute redundancies to handle critical failures like steering issues.
  4. 16:55 Building Community Trust: How Waymo engages with local authorities, police, and city stewards to ensure safe integration into urban environments.
  5. 20:45 The Impact of Generative AI: Analyzing whether the generative AI boom and reasoning models will fundamentally change the approach to autonomous driving architectures.
  6. 24:15 Vision-Language-Action Models: Exploring the use of VLMs and VLAs in robotics to tie together understanding, language, and physical actions.
  7. 28:25 The Simulation vs. Reality Gap: The challenge of building exhaustive, high-fidelity simulations that can validate complex driving recipes.
  8. 32:10 Predicting Actions in Environments: The technical difficulty of using simulators to envision how an environment reacts to predicted autonomous actions.