# SWE-bench & SWE-agent | Data Brew | Episode 44

Page: https://stenobird.com/podcast/data-brew-by-databricks/swe-bench-swe-agent-data-brew-episode-44
Text version: https://stenobird.com/podcast/data-brew-by-databricks/swe-bench-swe-agent-data-brew-episode-44.md
Podcast: [Data Brew by Databricks](https://stenobird.com/podcast/data-brew-by-databricks)
Published: 2025-04-17T14:00:00+00:00
Episode link: https://www.buzzsprout.com/1370119/episodes/16876013-swe-bench-swe-agent-data-brew-episode-44.mp3
Audio file: https://www.buzzsprout.com/1370119/episodes/16876013-swe-bench-swe-agent-data-brew-episode-44.mp3
Processing state: processed
JSON: https://stenobird.com/v1/public/podcasts/data-brew-by-databricks/episodes/swe-bench-swe-agent-data-brew-episode-44
Duration seconds: 2182

## Resource

In this episode, Kilian Lieret, Research Software Engineer, and Carlos Jimenez, Computer Science PhD Candidate at Princeton University, discuss SWE-bench and SWE-agent, two groundbreaking tools for evaluating and enhancing AI in software engineering. Highlights include: - SWE-bench: A benchmark for assessing AI models on real-world coding tasks. - Addressing data leakage concerns in GitHub-sourced benchmarks. - SWE-agent: An AI-driven system for navigating and solving coding challenges. - Ov...

## Actions

- request_transcript: `POST https://stenobird.com/v1/public/podcasts/data-brew-by-databricks/episodes/swe-bench-swe-agent-data-brew-episode-44/transcription-requests` — Idempotently request low-priority transcript generation for this episode.
- read_markdown: `GET https://stenobird.com/podcast/data-brew-by-databricks/swe-bench-swe-agent-data-brew-episode-44.md` — Read the agent-friendly Markdown representation of this episode resource.

A page view does not enqueue transcription. Agents should invoke `request_transcript` explicitly when they need this episode processed.

## Transcript

Full transcripts are not published on public pages unless there is a clear rights basis.