Now Assist Guardian - Prompt Injection - How Does it Work?

JustinD56844010
Tera Contributor

We are enabling Now Assist in Virtual Agent.

We are also enabling Now Assist Guardian - Prompt Injection.

 

We are struggling with understanding what ServiceNow considers a malicious prompt to be?
If a user using the Virtual Agent in Employee Center asks the Virtual Agent for another employee's home address - is that considered a malicious prompt?

 

The table that stores the logging for the malcious prompts, sn_nowassist_admin_sys_gen_ai_export_data, is not easily readable so it's hard to figure out how 'Prompt Injection' works.

2 REPLIES 2

Tanushree Maiti
Tera Patron
In this session we cover Now Assist Guardian, our newest feature delivering built-in guardrails for trustworthy AI. Launched in Q4 2024, Now Assist Guardian monitors and blocks offensive content and prompt injection attacks when using Now Assist generative AI products. It also covers sensitive ...
Are you worried about offensive or unsafe content sneaking into your AI experiences? In this quick episode of Did You Know, Willem Zeiler walks us through Now Assist Guardian - a built-in safeguard for AI-generated content on the Now Platform. You'll learn how to: - Enable the offensiveness filter
In this episode of TechBytes, AI Governance Product Manager at ServiceNow, Louis Philip Morin, discusses the critical aspects of trust in AI. The conversation delves into the challenges of creating AI experiences free from toxic and malicious language, highlighting the role of Now Assist Guardian
Now Assist Guardian is a built-in solution for enhancing the safety and reliability of AI powered experiences on the Now platform. Learn what it does, how to configure it, and how to analyze the data it collects. Chapters: 00:00 Now Assist Guardian features 00:56 Offensiveness guardrail 01:21

hugohuyghue
Tera Contributor

Hi @JustinD56844010
I do think the resources Tanushree Maiti suggested are very helpful and will be able to assist you.