From Assistant to Adversary: Exploiting Agentic AI Developer Tools

Developers are increasingly turning to AI-enabled tools for coding, including Cursor, OpenAI Codex, Claude Code, and GitHub Copilot. While these automation tools can enable faster development and reviews, additionally they present an expanding attack surface for threat actors.

These agentic tools have different implementations but all share the common framework of using LLMs to find out actions to tackle a developer’s behalf. More agentic autonomy means increased access and capabilities, with a corresponding increase in overall unpredictability.

On this blog, we are going to detail how an attacker can leverage easy watering hole attacks, introducing untrusted data to make the most of the mixture of assistive alignment and increasing agent autonomy that subsequently achieves distant code execution (RCE) on developer machines.

That is an summary of certainly one of the attack frameworks we presented at Black Hat USA in August 2025.