# Open Operator, Serverless Browsers and the Future of Computer-Using Agents Page: https://stenobird.com/podcast/latent-space-ai-engineer/open-operator-serverless-browsers-and-the-future-of-computer-using-agents Text version: https://stenobird.com/podcast/latent-space-ai-engineer/open-operator-serverless-browsers-and-the-future-of-computer-using-agents.md Podcast: [Latent Space: The AI Engineer Podcast](https://stenobird.com/podcast/latent-space-ai-engineer) Published: 2025-02-28T18:31:10+00:00 Episode link: https://www.latent.space/p/browserbase Audio file: https://api.substack.com/feed/podcast/158069581/7563f30a23fe8c6f589d9f9688d7e035.mp3 Processing state: processed JSON: https://stenobird.com/v1/public/podcasts/latent-space-ai-engineer/episodes/open-operator-serverless-browsers-and-the-future-of-computer-using-agents Duration seconds: 3693 ## Resource Today's episode is with Paul Klein, founder of Browserbase. We talked about building browser infrastructure for AI agents, the future of agent authentication, and their open source framework Stagehand. * [00:00:00] Introductions * [00:04:46] AI-specific challenges in browser infrastructure * [00:07:05] Multimodality in AI-Powered Browsing * [00:12:26] Running headless browsers at scale * [00:18:46] Geolocation when proxying * [00:21:25] CAPTCHAs and Agent Auth * [00:28:21] Building “User take over” functionality * [00:33:43] Stagehand: AI web browsing framework * [00:38:58] OpenAI's Operator and computer use agents * [00:44:44] Surprising use cases of Browserbase * [00:47:18] Future of browser automation and market competition * [00:53:11] Being a solo founder Transcript Alessio [00:00:04]: Hey everyone, welcome to the Latent Space podcast. This is Alessio, partner and CTO at Decibel Partners , and I'm joined by my co-host Swyx, founder of Smol.ai . swyx [00:00:12]: Hey, and today we are very blessed to have our friends, Paul Klein, for the fourth, the fourth, CEO of Browserbase. Welcome. Paul [00:00:21]: Thanks guys. Yeah, I'm happy to be here. I've been lucky to know both of you for like a couple of years now, I think. So it's just like we're hanging out, you know, with three ginormous microphones in front of our face. It's totally normal hangout. swyx [00:00:34]: Yeah. We've actually mentioned you on the podcast, I think, more often than any other Solaris tenant. Just because like you're one of the, you know, best performing, I think, LLM tool companies that have started up in the last couple of years. Paul [00:00:50]: Yeah, I mean, it's been a whirlwind of a year, like Browserbase is actually pretty close to our first birthday. So we are one years old. And going fr… ## Actions - request_transcript: `POST https://stenobird.com/v1/public/podcasts/latent-space-ai-engineer/episodes/open-operator-serverless-browsers-and-the-future-of-computer-using-agents/transcription-requests` — Idempotently request low-priority transcript generation for this episode. - read_markdown: `GET https://stenobird.com/podcast/latent-space-ai-engineer/open-operator-serverless-browsers-and-the-future-of-computer-using-agents.md` — Read the agent-friendly Markdown representation of this episode resource. A page view does not enqueue transcription. Agents should invoke `request_transcript` explicitly when they need this episode processed. ## Transcript Full transcripts are not published on public pages unless there is a clear rights basis.