About
People
Publications
Open Source
Demos
Events
Blog
Careers
Contact
English
English
Français
ServiceNow
ServiceNow AI Research
Tags
Computer Agents
ServiceNow AI Research
Computer Agents
Grounding Computer Use Agents on Human Demonstrations
Building reliable computer-use agents requires grounding: accurately connecting natural language instructions to the correct on-screen …
Aarash Feizi
,
Shravan Nayak
,
Xiangru Jian
,
Kevin Qinghong Lin
,
Kaixin Li
,
Rabiul Awal
,
Xing Han Lu
,
Johan Obando
,
Juan A. Rodriguez
,
Nicolas Chapados
,
David Vazquez
,
Adriana Romero Soriano
,
Reihaneh Rabbany
,
Perouz Taslakian
,
Christopher Pal
,
Spandana Gella
,
Sai Rajeswar Mudumba
International Conference on Learning Representations, 2026.
PDF
Cite
Shifting AI Security to the Left: Design-Time Defenses to Mitigate the Risks of Prompt Injections
Prompt injections pose a critical weakness for modern Large Language Models, making it difficult for AI to distinguish between …
Abhay Puri
,
Kevin Kasa
,
Kiarash Mohammadi
,
Georges Belanger Albarran
,
Mihir Bansal
,
Yanick Chénard
,
Marc-Etienne Brunet
,
Jason Stanley
NOW AI, 2025.
Cite
StarUI: Learning to Ground Agentic Perception in Desktop GUIs
Desktop environments remain the blind spot of multimodal-LLM agents: unlike web or mobile, they span heterogeneous software, lack …
Aarash Feizi
,
Shravan Nayak
,
Kevin Qinghong Lin
,
Kaixin Li
,
Rabiul Awal
,
Xiangru Jian
,
Juan A. Rodriguez
,
Nicolas Chapados
,
David Vazquez
,
Reihaneh Rabbany
,
Adriana Romero Soriano
,
Perouz Taslakian
,
Christopher Pal
,
Spandana Gella
,
Sai Rajeswar Mudumba
NOW AI, 2025.
Cite
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation
We present WebMMU, a multilingual benchmark that evaluates three core web tasks: (1) website visual question answering, (2) code …
Sai Rajeswar Mudumba
,
Christopher Pal
,
Perouz Taslakian
,
Spandana Gella
,
Rabiul Awal
,
Aarash Feizi
,
Mahsa Massoud
,
Zichao Li
,
Siva Reddy
,
David Vazquez
,
Suyuchen Wang
NOW AI, 2025.
Cite
A Guide To Effectively Leveraging LLMs for Low-Resource Text Summarization: Data Augmentation and Semi-supervised Approaches
Existing approaches for low-resource text summarization primarily employ large language models (LLMs) like GPT-3 or GPT-4 at inference …
Gaurav Sahu
,
Olga Vechtomova
,
Issam H. Laradji
North American Chapter of the Association for Computational Linguistics (NAACL), 2025.
PDF
Cite
Cite
×