Episode

Autonomous Organizations: Vending Bench & Beyond, w/ Lukas Petersson & Axel Backlund of Andon Labs

Podcast
"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis
Published
Aug 16, 2025
Duration seconds
6523
Processing state
processed
Canonical source
https://www.cognitiverevolution.ai
Audio
https://pdst.fm/e/mgln.ai/e/1113/pscrb.fm/rss/p/traffic.megaphone.fm/RINTP8503666800.mp3?updated=1755363378
JSON
/v1/public/podcasts/the-cognitive-revolution/episodes/autonomous-organizations-vending-bench-beyond-w-lukas-petersson-axel-backlund-of-andon-labs
Markdown
/podcast/the-cognitive-revolution/autonomous-organizations-vending-bench-beyond-w-lukas-petersson-axel-backlund-of-andon-labs.md

Actions

  • POST https://stenobird.com/v1/public/podcasts/the-cognitive-revolution/episodes/autonomous-organizations-vending-bench-beyond-w-lukas-petersson-axel-backlund-of-andon-labs/transcription-requests
    Idempotently request low-priority transcript generation for this episode.
  • GET https://stenobird.com/podcast/the-cognitive-revolution/autonomous-organizations-vending-bench-beyond-w-lukas-petersson-axel-backlund-of-andon-labs.md
    Read the agent-friendly Markdown representation of this episode resource.

Summary

Today Lukas Petersson and Axel Backlund of Andon Labs join The Cognitive Revolution to discuss their experiments deploying autonomous AI agents to run real-world vending machines, exploring the safety challenges and unexpected behaviors that emerge when frontier models like Claude and Grok operate without human oversight. Read transcript of the episode here. Check out our sponsors: Oracle Cloud Infrastructure, Shopify. Shownotes below brought to you by Notion AI Meeting Notes - try one month for free at ⁠https://⁠⁠notion.com/lp/nathan Autonomous Organization Philosophy: Andon Labs believes that AI models will improve to the point where human oversight becomes impractical due to efficiency constraints, leading them to pursue fully autonomous systems rather than gradual automation. Vending Bench as a Testing Ground: They created "Vending Bench" as a benchmark for testing long-term coherence of autonomous agents, using vending machines as a practical business case for experimentation. Domain-Specific vs General AI: There's a notable difference between optimizing AI for narrow domains (like vending machines) versus general-purpose AI, with domain-specific applications potentially being more manageable regarding reward hacking. Frontier Model Race: Major companies like OpenAI and Google are advancing rapidly in general reasoning capabilities (e.g., IMO Gold achievements) independent of narrow application research. Insurance and Liability: The insurance industry may play a significant role in AI adoption, with premiums potentially being much higher for general models that could be misused versus narrow-domain models with limited capabilities. For-profit AI Safety: The case for for-profit companies in AI safety has been historically neglected but is becoming clearer, with acc…