# What’s the path to AGI? A conversation with Turing Co-founder and CEO Jonathan Siddharth Page: https://stenobird.com/podcast/gradient-dissent/what-s-the-path-to-agi-a-conversation-with-turing-co-founder-and-ceo-jonathan-siddharth Text version: https://stenobird.com/podcast/gradient-dissent/what-s-the-path-to-agi-a-conversation-with-turing-co-founder-and-ceo-jonathan-siddharth.md Podcast: [Gradient Dissent: Conversations on AI](https://stenobird.com/podcast/gradient-dissent) Published: 2024-11-07T10:00:00+00:00 Episode link: https://wandb.ai/site/resources/podcast Audio file: https://podcasts.captivate.fm/media/e2b8442f-d5bb-4169-a8c9-d96aacf9c38f/GD023-Pod.mp3 Processing state: processed JSON: https://stenobird.com/v1/public/podcasts/gradient-dissent/episodes/what-s-the-path-to-agi-a-conversation-with-turing-co-founder-and-ceo-jonathan-siddharth Duration seconds: 3288 ## Resource The bottleneck for AGI has shifted from compute and raw internet data to the need for high-quality human intelligence. Turing CEO Jonathan Siddharth explains how scaling human-annotated reasoning and coding data is the next frontier for frontier model training. ## Highlights - Main idea: The primary bottleneck for AGI progress is no longer compute, but the availability of high-quality, intelligent tokens - Main idea: Coding data acts as a catalyst for broader capabilities, improving symbolic reasoning, logic, and mathematics - Practical takeaway: Scaling human intelligence requires a global, vetted 'developer cloud' to provide specialized domain expertise at scale - Failure mode: Relying solely on synthetic data or self-play without a robust reward function or simulator may limit model generalization - Practical takeaway: Enterprise AI adoption will likely remain 'human-in-the-loop' for the foreseeable future, focusing on audit and compliance ## Topics AGI, Large Language Models, Human-in-the-loop, Synthetic Data, Software Engineering, Machine Learning Training, Reasoning Capabilities, Enterprise AI ## Chapters - 1:00 — The Shift from Compute to Intelligence: Discussion on why the AGI bottleneck has moved from hardware and raw web data to the need for high-quality human reasoning. - 5:10 — Scaling Human Expertise: How to find, vet, and match the world's smartest engineers and scientists to power model training. - 9:10 — Optimizing for Price-Performance: Leveraging global labor markets to scale the demand for high-quality training datasets. - 17:30 — Beyond Code: Reasoning and Post-Training: The importance of using coding tokens to improve symbolic reasoning, logic, and arithmetic in LLMs. - 21:35 — Expanding into New Knowledge Domains: Moving beyond software engineering into marketing, finance, and healthcare to distill human knowledge. - 34:15 — The Limits of Synthetic Data: The challenges of using models to bootstrap themselves without effective simulators or reward functions. - 42:30 — The Future of Enterprise AI: Why the next wave of enterprise AI will focus on human-in-the-loop systems and compliance-driven copilots. ## Actions - request_transcript: `POST https://stenobird.com/v1/public/podcasts/gradient-dissent/episodes/what-s-the-path-to-agi-a-conversation-with-turing-co-founder-and-ceo-jonathan-siddharth/transcription-requests` — Idempotently request low-priority transcript generation for this episode. - read_markdown: `GET https://stenobird.com/podcast/gradient-dissent/what-s-the-path-to-agi-a-conversation-with-turing-co-founder-and-ceo-jonathan-siddharth.md` — Read the agent-friendly Markdown representation of this episode resource. A page view does not enqueue transcription. Agents should invoke `request_transcript` explicitly when they need this episode processed. ## Transcript Full transcripts are not published on public pages unless there is a clear rights basis.