# The Agent Reasoning Interface: o1/o3, Claude 3, ChatGPT Canvas, Tasks, and Operator — with Karina Nguyen of OpenAI Page: https://stenobird.com/podcast/latent-space-ai-engineer/the-agent-reasoning-interface-o1-o3-claude-3-chatgpt-canvas-tasks-and-operator-with-karina-nguyen-of-openai Text version: https://stenobird.com/podcast/latent-space-ai-engineer/the-agent-reasoning-interface-o1-o3-claude-3-chatgpt-canvas-tasks-and-operator-with-karina-nguyen-of-openai.md Podcast: [Latent Space: The AI Engineer Podcast](https://stenobird.com/podcast/latent-space-ai-engineer) Published: 2025-02-01T01:43:16+00:00 Episode link: https://www.latent.space/p/karina Audio file: https://api.substack.com/feed/podcast/155459121/4ad689b3be07c7ed322e79d11a28b061.mp3 Processing state: processed JSON: https://stenobird.com/v1/public/podcasts/latent-space-ai-engineer/episodes/the-agent-reasoning-interface-o1-o3-claude-3-chatgpt-canvas-tasks-and-operator-with-karina-nguyen-of-openai Duration seconds: 4120 ## Resource Sponsorships and tickets for the AI Engineer Summit are selling fast ! See the new website with speakers and schedules live! If you are building AI agents or leading teams of AI Engineers , this will be the single highest-signal conference of the year for you, this Feb 20-22nd in NYC. We’re pleased to share that Karina will be presenting OpenAI’s closing keynote at the AI Engineer Summit. We were fortunate to get some time with her today to introduce some of her work, and hope this serves as nice background for her talk! There are very few early AI careers that have been as impactful as Karina Nguyen’s. After stints at Notion, Square, Dropbox, Primer, the New York Times, and UC Berkeley, She joined Anthropic as employee ~60 and worked on a wide range of research/product roles for Claude 1, 2, and 3. We’ll just let her LinkedIn speak for itself: Now, as Research manager and Post-training lead in Model Behavior at OpenAI, she creates new interaction paradigms for reasoning interfaces and capabilities, like ChatGPT Canvas , Tasks , SimpleQA , streaming chain-of-thought for o1 models , and more via novel synthetic model training. Ideal AI Research+Product Process In the podcast we got a sense of what Karina has found works for her and her team to be as productive as they have been: * Write PRD (Define what you want) * Funding (Get resources) * Prototype Prompted Baseline (See what’s possible) * Write and Run Evals (Get failures to hillclimb) * Model training (Exceed baseline without overfitting) * Bugbash (Find bugs and solve them) * Ship (Get users!) We could turn this into a snazzy viral graphic but really this is all it is. Simple to say, difficult to do well. Hopefully it helps you define your process if you do similar product-research work. Show Notes * Our Reasoning… ## Actions - request_transcript: `POST https://stenobird.com/v1/public/podcasts/latent-space-ai-engineer/episodes/the-agent-reasoning-interface-o1-o3-claude-3-chatgpt-canvas-tasks-and-operator-with-karina-nguyen-of-openai/transcription-requests` — Idempotently request low-priority transcript generation for this episode. - read_markdown: `GET https://stenobird.com/podcast/latent-space-ai-engineer/the-agent-reasoning-interface-o1-o3-claude-3-chatgpt-canvas-tasks-and-operator-with-karina-nguyen-of-openai.md` — Read the agent-friendly Markdown representation of this episode resource. A page view does not enqueue transcription. Agents should invoke `request_transcript` explicitly when they need this episode processed. ## Transcript Full transcripts are not published on public pages unless there is a clear rights basis.