Episode
Along The Edge e3: Breaking AI Agents: From Jailbreaks to MCP Exploits with Javi Rivera
- Published
- Feb 13, 2026
- Duration seconds
- 3361
- Processing state
processed- Canonical source
- https://share.transistor.fm/s/d219fca3
Actions
POST https://stenobird.com/v1/public/podcasts/along-the-edge-agentic-ai/episodes/along-the-edge-e3-breaking-ai-agents-from-jailbreaks-to-mcp-exploits-with-javi-rivera/transcription-requests
Idempotently request low-priority transcript generation for this episode.GET https://stenobird.com/podcast/along-the-edge-agentic-ai/along-the-edge-e3-breaking-ai-agents-from-jailbreaks-to-mcp-exploits-with-javi-rivera.md
Read the agent-friendly Markdown representation of this episode resource.
Summary
AI agents introduce a new attack surface where traditional vulnerabilities like SQL injection and XSS are weaponized through model capabilities. This episode explores how the integration of tools and web access transforms prompt injection into a high-impact supply chain risk.
Topics
- AI Security
- Agentic AI
- Prompt Injection
- Model Context Protocol
- Penetration Testing
- LLM Vulnerabilities
- Cybersecurity
- Supply Chain Attacks
Highlights
- Main idea: Agentic AI security is less about breaking the LLM and more about exploiting the tools and environments the agent can access
- Failure mode: Using unverified MCP servers creates a massive supply chain risk by allowing malicious code execution through trusted agent workflows
- Practical takeaway: Defense requires a 'defense in depth' approach, including sandboxing, least privilege, and strict input/output sanitization
- Technical distinction: Security researchers must differentiate between manipulating the model's weights/logic and manipulating the agent's tool-use capabilities
- Future threat: The rise of autonomous, thinking bots will enable automated, large-scale offensive campaigns that can adapt to defenses in real-time
Chapters
1:00The Evolution of Offensive Security: Javi Rivera discusses his transition from traditional network and web application penetration testing to focusing on the vulnerabilities within agentic workflows.5:05Classic Exploits in an Agentic World: An analysis of how SQL injection, XSS, and command injection remain viable threats when agents are granted access to external APIs and web tools.13:15System Prompts and Constraints: How developers use system prompts to define agent boundaries and how attackers attempt to bypass these instructions to manipulate agent behavior.21:35Data Exfiltration Techniques: Exploring how attackers use structured patterns to trick models into leaking sensitive information like credentials or internal file contents.30:10Indirect Prompt Injection Demo: A walkthrough of the Gray Swan AI Arena, demonstrating how an attacker can manipulate an agent by poisoning the data sources it retrieves.42:50The MCP Supply Chain Risk: A critical look at the security implications of the Model Context Protocol (MCP) and the dangers of connecting agents to unverified third-party servers.47:20Defensive Strategies and Best Practices: Practical advice on implementing guardrails, sanitizing tool outputs, and maintaining strict access controls to mitigate agentic risks.