# Powering AI with the World's Largest Computer Chip with Joel Hestness - #684 Page: https://stenobird.com/podcast/twiml-ai-podcast/powering-ai-with-the-world-s-largest-computer-chip-with-joel-hestness-684 Text version: https://stenobird.com/podcast/twiml-ai-podcast/powering-ai-with-the-world-s-largest-computer-chip-with-joel-hestness-684.md Podcast: [The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)](https://stenobird.com/podcast/twiml-ai-podcast) Published: 2024-05-13T19:58:23+00:00 Episode link: https://twimlai.com/podcast/twimlai/powering-ai-with-the-worlds-largest-computer-chip/ Audio file: https://pscrb.fm/rss/p/traffic.megaphone.fm/MLN6928791405.mp3?updated=1715630189 Processing state: failed JSON: https://stenobird.com/v1/public/podcasts/twiml-ai-podcast/episodes/powering-ai-with-the-world-s-largest-computer-chip-with-joel-hestness-684 Duration seconds: 3306 ## Resource Today we're joined by Joel Hestness, principal research scientist and lead of the core machine learning team at Cerebras. We discuss Cerebras’ custom silicon for machine learning, Wafer Scale Engine 3, and how the latest version of the company’s single-chip platform for ML has evolved to support large language models. Joel shares how WSE3 differs from other AI hardware solutions, such as GPUs, TPUs, and AWS’ Inferentia, and talks through the homogenous design of the WSE chip and its memory architecture. We discuss software support for the platform, including support by open source ML frameworks like Pytorch, and support for different types of transformer-based models. Finally, Joel shares some of the research his team is pursuing to take advantage of the hardware's unique characteristics, including weight-sparse training, optimizers that leverage higher-order statistics, and more. The complete show notes for this episode can be found at twimlai.com/go/684. ## Actions - request_transcript: `POST https://stenobird.com/v1/public/podcasts/twiml-ai-podcast/episodes/powering-ai-with-the-world-s-largest-computer-chip-with-joel-hestness-684/transcription-requests` — Idempotently request low-priority transcript generation for this episode. - read_markdown: `GET https://stenobird.com/podcast/twiml-ai-podcast/powering-ai-with-the-world-s-largest-computer-chip-with-joel-hestness-684.md` — Read the agent-friendly Markdown representation of this episode resource. A page view does not enqueue transcription. Agents should invoke `request_transcript` explicitly when they need this episode processed. ## Transcript Full transcripts are not published on public pages unless there is a clear rights basis.