Attack What Matters: Integrating Expert Insight and Automation in Threat-Model-Aligned Red Teaming

Kiarash Mohammadi, Abhay Puri, Georges Belanger Albarran, Mihir Bansal, Navdeep Gill, Yanick Chénard, Segan Subramanian, Marc-Etienne Brunet , Jason Stanley

November 2025

Abstract

Prompt injection attacks target a key vulnerability in modern large language models: their inability to reliably distinguish between trusted and untrusted and potentially malicious instructions. This vulnerability has significant implications for customers using AI on the ServiceNow platform. We present an end-to-end Automated Red Teaming pipeline along with a case study on the Security Operations Now Assist. This case study highlights how prompt injections discovered by our tool can manipulate AI recommendations in SecOps. It includes examples of manipulated phishing incidents, demonstrating the vulnerability in production settings.

Type

Conference paper

Publication

NOW AI