ServiceNow recherche

DoomArena: A framework for Testing AI Agents Against Evolving Security Threats

Résumé

We present DoomArena, a security evaluation framework for AI agents. DoomArena is designed on three principles: 1) It is a \emph{plug-in} framework and integrates easily into realistic agentic frameworks like BrowserGym (for web agents) and $\tau$-bench (for tool calling agents); 2) It is \emph{configurable} and allows for detailed threat modeling, allowing configuration of specific components of the agentic framework being attackable, and specifying targets for the attacker; and 3) It is \emph{modular} and decouples the development of attacks from details of the environment in which the agent is deployed, allowing for the same attacks to be applied across multiple environments. We illustrate several advantages of our framework, including enabling the development of generic attacker agents, the ability to easily combine several previously published attacks to enable comprehensive and fine-grained security testing, and the ability to analyze trade-offs between various vulnerabilities. We apply DoomArena to state-of-the-art (SOTA) web and tool-calling agents and find a number of surprising results: 1) SOTA agents have varying levels of vulnerability to different threat models (malicious user vs malicious environment), and there is no Pareto dominant agent across all threat models; 2) When multiple attacks are applied to an agent, they often combine constructively; 3) Guardrail model-based defenses seem to fail, while defenses based on powerful SOTA LLMs work much better.

Publication
Workshop at the International Conference of Machine Learning (ICML)
Léo	Boisvert
Léo Boisvert
Visiting Researcher

Visiting Researcher at AI Frontier Research located at Montreal, QC, Canada.

Abhay Puri
Abhay Puri
Applied Research Scientist

Applied Research Scientist at AI Research Deployment​ located at Montreal, QC, Canada.

Gabriel Huang
Gabriel Huang
Research Scientist

Research Scientist at AI Frontier Research located at Montreal, QC, Canada.

Mihir Bansal
Mihir Bansal
Machine Learning Engineer

Machine Learning Engineer at AI Research Deployment​ located at Santa Clara, CA, USA.

Alexandre Lacoste
Alexandre Lacoste
Research Lead

Research Lead at AI Frontier Research located at Montreal, QC, Canada.

Alexandre Drouin
Alexandre Drouin
Head of AI Frontier Research​

Head of AI Frontier Research​ at AI Frontier Research located at Montreal, QC, Canada.

Jason Stanley
Jason Stanley
Head of AI Research Deployment​

Head of AI Research Deployment​ at AI Research Deployment​ located at Montreal, QC, Canada.