{"podcast":{"title":"The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)","slug":"twiml-ai-podcast","podcast_index_feed_id":1045879,"rss_url":"https://feeds.megaphone.fm/MLN2155636147","website_url":"https://twimlai.com","image_url":"https://megaphone.imgix.net/podcasts/35230150-ee98-11eb-ad1a-b38cbabcd053/image/TWIML_AI_Podcast_Official_Cover_Art_1400px.png?ixlib=rails-4.3.1&max-w=3000&max-h=3000&fit=crop&auto=format,compress","author":"TWIML","episode_count":785,"summary":"Machine learning and artificial intelligence are dramatically changing the way businesses operate and people live. The TWIML AI Podcast brings the top minds and ideas from the world of ML and AI to a broad and influential community of ML/AI researchers, data scientists, engineers and tech-savvy business and IT leaders. Hosted by Sam Charrington, a sought after industry analyst, speaker, commentator and thought leader. Technologies covered include machine learning, artificial intelligence, deep learning, natural language processing, neural networks, analytics, computer science, data science and more.","last_synced_at":null,"page_url":"https://stenobird.com/podcast/twiml-ai-podcast"},"episode":{"title":"Genie 3: A New Frontier for World Models with Jack Parker-Holder and Shlomi Fruchter - #743","slug":"genie-3-a-new-frontier-for-world-models-with-jack-parker-holder-and-shlomi-fruchter-743","published_at":"2025-08-19T17:57:00+00:00","page_url":"https://stenobird.com/podcast/twiml-ai-podcast/genie-3-a-new-frontier-for-world-models-with-jack-parker-holder-and-shlomi-fruchter-743","show_page_url":"https://stenobird.com/podcast/twiml-ai-podcast","url":"https://twimlai.com/podcast/twimlai/genie-3-a-new-frontier-for-world-models/","audio_url":"https://pscrb.fm/rss/p/traffic.megaphone.fm/MLN4297409814.mp3?updated=1755626878","summary":"Google DeepMind researchers discuss Genie 3, a generative world model capable of creating interactive, playable virtual environments from text and video prompts. The discussion explores the technical leap from static video generation to real-time, consistent, and promptable simulated worlds.","meta_description":"Explore the architecture and future of Genie 3, Google DeepMind's new world model for generating interactive, high-resolution virtual environments.","key_points":["Main idea: Genie 3 represents a 100x improvement in resolution, duration, and generation speed over its predecessor","Technical breakthrough: The integration of text-to-video capabilities allows for highly compressed, semantic control over world generation","Core challenge: Maintaining visual and temporal consistency when the camera moves or the user interacts with the environment","Practical takeaway: World models like Genie 3 can serve as dynamic, scalable training environments for embodied AI agents","Future vision: Using generative worlds for personalized education, psychological exposure therapy, and complex human-agent interaction simulations"],"chapters":[{"start_ms":60000,"title":"Introduction to Genie 3","summary":"A look back at the evolution of the Genie project and the scale of improvements in the new model."},{"start_ms":590000,"title":"The Value of World Models","summary":"Discussing why generative world models are a powerful alternative to traditional distributed reinforcement learning."},{"start_ms":1155000,"title":"Architectural Breakthroughs","summary":"How leveraging text-to-video research enabled the transition from static images to interactive environments."},{"start_ms":1680000,"title":"Achieving Visual Consistency","summary":"The technical difficulty of ensuring the world remains stable during camera movement and user input."},{"start_ms":1965000,"title":"Prompting with Video","summary":"Exploring the 'inception' capability where the model can be prompted using existing video content."},{"start_ms":2535000,"title":"Promptable World Events","summary":"How users can use text to trigger specific behaviors or changes within the generated environment."},{"start_ms":3335000,"title":"The Future of Embodied AI","summary":"Using generative worlds to train agents to interact with humans and physical objects in realistic scenarios."}],"topics":["Genie 3","World Models","Google DeepMind","Generative AI","Embodied AI","Reinforcement Learning","Computer Vision","Interactive Simulation"],"duration_seconds":3661,"processing_state":"processed","actions":[{"name":"request_transcript","method":"POST","url":"https://stenobird.com/v1/public/podcasts/twiml-ai-podcast/episodes/genie-3-a-new-frontier-for-world-models-with-jack-parker-holder-and-shlomi-fruchter-743/transcription-requests","description":"Idempotently request low-priority transcript generation for this episode."},{"name":"read_markdown","method":"GET","url":"https://stenobird.com/podcast/twiml-ai-podcast/genie-3-a-new-frontier-for-world-models-with-jack-parker-holder-and-shlomi-fruchter-743.md","description":"Read the agent-friendly Markdown representation of this episode resource."}]}}