# Training Data Locality and Chain-of-Thought Reasoning in LLMs with Ben Prystawski - #673 Page: https://stenobird.com/podcast/twiml-ai-podcast/training-data-locality-and-chain-of-thought-reasoning-in-llms-with-ben-prystawski-673 Text version: https://stenobird.com/podcast/twiml-ai-podcast/training-data-locality-and-chain-of-thought-reasoning-in-llms-with-ben-prystawski-673.md Podcast: [The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)](https://stenobird.com/podcast/twiml-ai-podcast) Published: 2024-02-26T19:17:00+00:00 Episode link: https://twimlai.com/podcast/twimlai/training-data-locality-and-chain-of-thought-reasoning-in-llms/ Audio file: https://pscrb.fm/rss/p/traffic.megaphone.fm/MLN1845606139.mp3?updated=1708975551 Processing state: failed JSON: https://stenobird.com/v1/public/podcasts/twiml-ai-podcast/episodes/training-data-locality-and-chain-of-thought-reasoning-in-llms-with-ben-prystawski-673 Duration seconds: 1503 ## Resource Today we’re joined by Ben Prystawski, a PhD student in the Department of Psychology at Stanford University working at the intersection of cognitive science and machine learning. Our conversation centers on Ben’s recent paper, “Why think step by step? Reasoning emerges from the locality of experience,” which he recently presented at NeurIPS 2023. In this conversation, we start out exploring basic questions about LLM reasoning, including whether it exists, how we can define it, and how techniques like chain-of-thought reasoning appear to strengthen it. We then dig into the details of Ben’s paper, which aims to understand why thinking step-by-step is effective and demonstrates that local structure is the key property of LLM training data that enables it. The complete show notes for this episode can be found at twimlai.com/go/673. ## Actions - request_transcript: `POST https://stenobird.com/v1/public/podcasts/twiml-ai-podcast/episodes/training-data-locality-and-chain-of-thought-reasoning-in-llms-with-ben-prystawski-673/transcription-requests` — Idempotently request low-priority transcript generation for this episode. - read_markdown: `GET https://stenobird.com/podcast/twiml-ai-podcast/training-data-locality-and-chain-of-thought-reasoning-in-llms-with-ben-prystawski-673.md` — Read the agent-friendly Markdown representation of this episode resource. A page view does not enqueue transcription. Agents should invoke `request_transcript` explicitly when they need this episode processed. ## Transcript Full transcripts are not published on public pages unless there is a clear rights basis.