{"podcast":{"title":"Latent Space: The AI Engineer Podcast","slug":"latent-space-ai-engineer","podcast_index_feed_id":6058902,"rss_url":"https://api.substack.com/feed/podcast/1084089.rss","website_url":"https://www.latent.space/podcast","image_url":"https://substackcdn.com/feed/podcast/1084089/ca7468da5614a246d2906ee8926f6de7.jpg","author":"Latent.Space","episode_count":204,"summary":"The AI Engineer newsletter + Top technical AI podcast. How leading labs build Agents, Models, Infra, & AI for Science. See https://latent.space/about for highlights from Greg Brockman, Andrej Karpathy, George Hotz, Simon Willison, Soumith Chintala et al!","last_synced_at":null,"page_url":"https://stenobird.com/podcast/latent-space-ai-engineer"},"episode":{"title":"SF Compute: Commoditizing Compute to solve the GPU Bubble forever","slug":"sf-compute-commoditizing-compute-to-solve-the-gpu-bubble-forever","published_at":"2025-04-11T19:22:00+00:00","page_url":"https://stenobird.com/podcast/latent-space-ai-engineer/sf-compute-commoditizing-compute-to-solve-the-gpu-bubble-forever","show_page_url":"https://stenobird.com/podcast/latent-space-ai-engineer","url":"https://www.latent.space/p/sfcompute","audio_url":"https://api.substack.com/feed/podcast/160956446/0e916bdb91f24e33561bffa51f5bc611.mp3","summary":"We are calling for the world’s best AI Engineer talks for AI Architects, /r/localLlama, Model Context Protocol (MCP), GraphRAG, AI in Action, Evals, Agent Reliability, Reasoning and RL, Retrieval/Search/RecSys , Security, Infrastructure, Generative Media, AI Design & Novel AI UX, AI Product Management, Autonomy, Robotics, and Embodied Agents, Computer-Using Agents (CUA), SWE Agents, Vibe Coding, Voice, Sales/Support Agents at AIEWF 2025 ! Fill out the 2025 State of AI Eng survey for $250 in Amazon cards and see you from Jun 3-5 in SF! Coreweave’s now-successful IPO has led to a lot of questions about the GPU Neocloud market, which Dylan Patel has written extensively about on SemiAnalysis . Understanding markets requires an interesting mix of technical and financial expertise, so this will be a different kind of episode than our usual LS domain. When we first published $2 H100s: How the GPU Rental Bubble Burst , we got 2 kinds of reactions on Hacker News : * “Ah, now the AI bubble is imploding!” * “Duh, this is how it works in every GPU cycle, are you new here?” We don’t think either reaction is quite right. Specifically, it is not normal for the prices of one of the world’s most important resources right now to swing from $1 to $8 per hour based on drastically inelastic demand AND supply curves - from 3 year lock-in contracts to stupendously competitive over-ordering dynamics for NVIDIA allocations — especially with increasing baseline compute needed for even the simplest academic ML research and for new AI startups getting off the ground. We’re fortunate today to have Evan Conrad, CEO of SFCompute , one of the most exciting GPU marketplace startups, talk us through his theory of the economics of GPU markets, and why he thinks CoreWeave and Modal are well positioned, b…","meta_description":"We are calling for the world’s best AI Engineer talks for AI Architects, /r/localLlama, Model Context Protocol (MCP), GraphRAG, AI in Action, Evals, Agent…","key_points":[],"chapters":[],"topics":[],"duration_seconds":4321,"processing_state":"processed","actions":[{"name":"request_transcript","method":"POST","url":"https://stenobird.com/v1/public/podcasts/latent-space-ai-engineer/episodes/sf-compute-commoditizing-compute-to-solve-the-gpu-bubble-forever/transcription-requests","description":"Idempotently request low-priority transcript generation for this episode."},{"name":"read_markdown","method":"GET","url":"https://stenobird.com/podcast/latent-space-ai-engineer/sf-compute-commoditizing-compute-to-solve-the-gpu-bubble-forever.md","description":"Read the agent-friendly Markdown representation of this episode resource."}]}}