Accueil
Équipe
Publications
Évènements
Blog
Carrières
Nous joindre
Français
Français
English
ServiceNow
ServiceNow IA recherche
Tags
Agents
ServiceNow IA recherche
Agents
Fine-Tune an SLM or Prompt an LLM? The Case of Generating Low-Code Workflows
Large Language Models (LLMs) such as GPT-4o can handle a wide range of complex tasks with the right prompt. As per token costs are …
Orlando Marquez
,
Patrice Béchard
,
Emily Chen
,
Maggie Baird
,
JingFei Chen
Knowledge Discovery and Data Mining, 2025.
Article
Citation
AgentAda: Skill-Adaptive Data Analytics for Tailored Insight Discovery
We introduce AgentAda, the first LLM-powered analytics agent that can learn and use new analytics skills to extract more specialized …
Amirhossein Abaskohi
,
Amrutha Ramesh
,
Shailesh Nanisetty
,
Chirag Goel
,
David Vazquez
,
Christopher Pal
,
Spandana Gella
,
Giuseppe Carenini
,
Issam H. Laradji
Workshop at the Annual Meeting of the Association for Computational Linguistics (ACL), 2025.
Article
Citation
DoomArena: A framework for Testing AI Agents Against Evolving Security Threats
We present DoomArena, a security evaluation framework for AI agents. DoomArena is designed on three principles: 1) It is a …
Léo Boisvert
,
Abhay Puri
,
Gabriel Huang
,
Mihir Bansal
,
Chandra Kiran Reddy Evuru
,
Avinandan Bose
,
Quentin Cappart
,
Maryam Fazel
,
Alexandre Lacoste
,
Alexandre Drouin
,
Jason Stanley
,
Krishnamurthy (Dj) Dvijotham
Workshop at the International Conference of Machine Learning (ICML), 2025.
Article
Citation
Code
Silent Sabotage: Injecting Backdoors into AI Agents Through Fine-Tuning
The rise of AI agents that can use tools, browse the web and interact with computers on behalf of a user, has sparked strong interest …
Léo Boisvert
,
Abhay Puri
,
Chandra Kiran Reddy Evuru
,
Joshua Kazdan
,
Avinandan Bose
,
Quentin Cappart
,
Maryam Fazel
,
Sai Rajeswar Mudumba
,
Jason Stanley
,
Nicolas Chapados
,
Alexandre Drouin
,
Krishnamurthy (Dj) Dvijotham
Workshop at the International Conference of Machine Learning (ICML), 2025.
Article
Citation
SafeArena: Evaluating the Safety of Autonomous Web Agents
LLM-based agents are becoming increasingly proficient at solving web-based tasks. With this capability comes a greater risk of misuse …
Ada Tur
,
Nicholas Meade
,
Xing Han Lu
,
Alejandra Zambrano
,
Arkil Patel
,
Esin Durmus
,
Spandana Gella
,
Karolina Stanczak
,
Siva Reddy
International Conference on Machine Learning (ICML), 2025.
Article
Citation
Code
UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction
Developing autonomous agents that can navigate diverse Graphical User Interfaces (GUIs) and solve complex tasks is essential for …
Shravan Nayak
,
Xiangru Jian
,
Kevin Lin
,
Juan A. Rodriguez
,
Motek Kalsi
,
Nicolas Chapados
,
Tamer Özsu
,
Aishwarya Agrawal
,
David Vazquez
,
Christopher Pal
,
Perouz Taslakian
,
Spandana Gella
,
Sai Rajeswar Mudumba
International Conference on Machine Learning (ICML), 2025.
Article
Citation
Code
Vidéo
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation
Understanding diverse web data and automating web development presents an exciting challenge for agentic AI. While existing benchmarks …
Rabiul Awal
,
Mahsa Massoud
,
Zichao Li
,
Aarash Feizi
,
Suyuchen Wang
,
Christopher Pal
,
Aishwarya Agrawal
,
David Vazquez
,
Siva Reddy
,
Juan A. Rodriguez
,
Perouz Taslakian
,
Sai Rajeswar Mudumba
Workshop at the Computer Vision and Pattern Recognition Conference (CVPR), 2025.
Article
Citation
StarVector: Generating Scalable Vector Graphics Code from Images and Text
Scalable Vector Graphics (SVGs) are vital for modern image rendering due to their scalability and versatility. Previous SVG generation …
Juan A. Rodriguez
,
Abhay Puri
,
Shubham Agarwal
,
Issam H. Laradji
,
Pau Rodriguez
,
Sai Rajeswar Mudumba
,
David Vazquez
,
Christopher Pal
,
Marco Pedersoli
Computer Vision and Pattern Recognition (CVPR), 2025.
Article
Citation
Code
Vidéo
Keeping up with dynamic attackers: Certifying robustness to adaptive online data poisoning
The rise of foundation models fine-tuned on human feedback from potentially untrusted users has increased the risk of adversarial data …
Avinandan Bose
,
Laurent Lessard
,
Maryam Fazel
,
Krishnamurthy (Dj) Dvijotham
International Conference on Artificial Intelligence and Statistics (AISTATS), 2025.
Article
Citation
Code
Vidéo
A Guide To Effectively Leveraging LLMs for Low-Resource Text Summarization: Data Augmentation and Semi-supervised Approaches
Existing approaches for low-resource text summarization primarily employ large language models (LLMs) like GPT-3 or GPT-4 at inference …
Gaurav Sahu
,
Olga Vechtomova
,
Issam H. Laradji
North American Chapter of the Association for Computational Linguistics (NAACL), 2025.
Article
Citation
Code
«
»
Citation
×