# Genie: Generative Interactive Environments with Ashley Edwards - #696 Page: https://stenobird.com/podcast/twiml-ai-podcast/genie-generative-interactive-environments-with-ashley-edwards-696 Text version: https://stenobird.com/podcast/twiml-ai-podcast/genie-generative-interactive-environments-with-ashley-edwards-696.md Podcast: [The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)](https://stenobird.com/podcast/twiml-ai-podcast) Published: 2024-08-05T17:14:00+00:00 Episode link: https://twimlai.com/podcast/twimlai/genie-generative-interactive-environments/ Audio file: https://pscrb.fm/rss/p/traffic.megaphone.fm/MLN9110516542.mp3?updated=1722879160 Processing state: failed JSON: https://stenobird.com/v1/public/podcasts/twiml-ai-podcast/episodes/genie-generative-interactive-environments-with-ashley-edwards-696 Duration seconds: 2811 ## Resource Today, we're joined by Ashley Edwards, a member of technical staff at Runway, to discuss Genie: Generative Interactive Environments, a system for creating ‘playable’ video environments for training deep reinforcement learning (RL) agents at scale in a completely unsupervised manner. We explore the motivations behind Genie, the challenges of data acquisition for RL, and Genie’s capability to learn world models from videos without explicit action data, enabling seamless interaction and frame prediction. Ashley walks us through Genie’s core components—the latent action model, video tokenizer, and dynamics model—and explains how these elements collaborate to predict future frames in video sequences. We discuss the model architecture, training strategies, benchmarks used, as well as the application of spatiotemporal transformers and the MaskGIT techniques used for efficient token prediction and representation. Finally, we touched on Genie’s practical implications, its comparison to other video generation models like “Sora,” and potential future directions in video generation and diffusion models. The complete show notes for this episode can be found at https://twimlai.com/go/696. ## Actions - request_transcript: `POST https://stenobird.com/v1/public/podcasts/twiml-ai-podcast/episodes/genie-generative-interactive-environments-with-ashley-edwards-696/transcription-requests` — Idempotently request low-priority transcript generation for this episode. - read_markdown: `GET https://stenobird.com/podcast/twiml-ai-podcast/genie-generative-interactive-environments-with-ashley-edwards-696.md` — Read the agent-friendly Markdown representation of this episode resource. A page view does not enqueue transcription. Agents should invoke `request_transcript` explicitly when they need this episode processed. ## Transcript Full transcripts are not published on public pages unless there is a clear rights basis.