About
People
Publications
Open Source
Demos
Events
Blog
Careers
Contact
English
English
Français
ServiceNow
ServiceNow AI Research
Tags
Web Agents
ServiceNow AI Research
Web Agents
The BrowserGym Ecosystem for Web Agent Research
The BrowserGym ecosystem addresses the growing need for efficient evaluation and benchmarking of web agents, particularly those …
Thibault Le Sellier De Chezelles
,
Maxime Gasse
,
Alexandre Drouin
,
Massimo Caccia
,
Léo Boisvert
,
Megh Thakkar
,
Tom Marty
,
Rim Assouel
,
Sahar Omidi Shayegan
,
Siva Reddy
,
Quentin Cappart
,
Graham Neubig
,
Nicolas Chapados
,
Alexandre Lacoste
Transactions on Machine Learning Research (TMLR), 2025.
PDF
Cite
AgentMerge: Enhancing Generalization in Fine-Tuned LLM Agents
Recent advancements in large language models (LLMs) have spurred interest in developing autonomous agents capable of performing complex …
Megh Thakkar
,
Léo Boisvert
,
Thibault Le Sellier De Chezelles
,
Alexandre Piche
,
Maxime Gasse
,
Alexandre Lacoste
,
Massimo Caccia
Workshop at the Neural Information Processing Systems (NeurIPS), 2024.
PDF
Cite
Fine-Tuning Web Agents: It Works, But It's Trickier Than You Think
Recent advancements in large language models (LLMs) have sparked interest in developing autonomous web agents capable of performing …
Massimo Caccia
,
Megh Thakkar
,
Léo Boisvert
,
Thibault Le Sellier De Chezelles
,
Alexandre Piche
,
Nicolas Chapados
,
Alexandre Drouin
,
Maxime Gasse
,
Alexandre Lacoste
Workshop at the Neural Information Processing Systems (NeurIPS), 2024.
PDF
Cite
WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
The ability of large language models (LLMs) to mimic human-like intelligence has led to a surge in LLM-based autonomous agents. Though …
Léo Boisvert
,
Megh Thakkar
,
Maxime Gasse
,
Massimo Caccia
,
Thibault Le Sellier De Chezelles
,
Quentin Cappart
,
Nicolas Chapados
,
Alexandre Lacoste
,
Alexandre Drouin
NeurIPS Datasets and Benchmarks Track (NeurIPS Datasets), 2024.
PDF
Cite
Video
Fine-Tuning Web Agents: It Works, But It's Trickier Than You Think
Recent advancements in large language models (LLMs) have sparked interest in developing autonomous web agents capable of performing …
Massimo Caccia
,
Megh Thakkar
,
Léo Boisvert
,
Thibault Le Sellier De Chezelles
,
Alexandre Piche
,
Nicolas Chapados
,
Alexandre Drouin
,
Maxime Gasse
,
Alexandre Lacoste
NOW AI Conference (NOWAI), 2024.
PDF
Cite
An Ecosystem for Web Agents: WorkArena, BrowserGym, AgentLab and more
The BrowserGym ecosystem addresses the growing need for efficient evaluation and benchmarking of web agents, particularly those …
Alexandre Lacoste
,
Maxime Gasse
,
Thibault Le Sellier De Chezelles
,
Massimo Caccia
,
Léo Boisvert
,
Megh Thakkar
,
Alexandre Drouin
,
Nicolas Chapados
Montreal AI Symposium (MAIS), 2024.
Cite
WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?
We study the use of large language model-based agents for interacting with software via web browsers. Unlike prior work, we focus on …
Alexandre Drouin
,
Maxime Gasse
,
Massimo Caccia
,
Issam H. Laradji
,
Manuel Del Verme
,
Tom Marty
,
Léo Boisvert
,
Megh Thakkar
,
Quentin Cappart
,
David Vazquez
,
Nicolas Chapados
,
Alexandre Lacoste
International Conference on Machine Learning (ICML), 2024.
PDF
Cite
Video
WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?
We study the use of large language model-based agents for interacting with software via web browsers. Unlike prior work, we focus on …
Alexandre Drouin
,
Maxime Gasse
,
Massimo Caccia
,
Issam H. Laradji
,
Manuel Del Verme
,
Tom Marty
,
David Vazquez
,
Nicolas Chapados
,
Alexandre Lacoste
Workshop at the International Conference of Learning Representation (ICLR), 2024.
PDF
Cite
Video
The Unsolved Challenges of LLMs in Open-Ended Web Tasks: A Case Study
In this work, we investigate the challenges associated with developing goal-driven AI agents capable of performing open-ended tasks in …
Rim Assouel
,
Tom Marty
,
Massimo Caccia
,
Issam H. Laradji
,
Alexandre Drouin
,
Sai Rajeswar Mudumba
,
Hector Palacios
,
Quentin Cappart
,
David Vazquez
,
Nicolas Chapados
,
Maxime Gasse
,
Alexandre Lacoste
Workshop at the Neural Information Processing Systems (NeurIPS), 2023.
PDF
Cite
Video
«
Cite
×