Web Agents

Fine-Tuning Web Agents: It Works, But It's Trickier Than You Think

Recent advancements in large language models (LLMs) have sparked interest in developing autonomous web agents capable of performing …

Massimo Caccia, Megh Thakkar, Léo Boisvert, Thibault Le Sellier De Chezelles, Alexandre Piche, Nicolas Chapados, Alexandre Drouin, Maxime Gasse, Alexandre Lacoste

Workshop at the Neural Information Processing Systems (NeurIPS), 2024.

WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks

The ability of large language models (LLMs) to mimic human-like intelligence has led to a surge in LLM-based autonomous agents. Though …

Léo Boisvert, Megh Thakkar, Maxime Gasse, Massimo Caccia, Thibault Le Sellier De Chezelles, Quentin Cappart, Nicolas Chapados, Alexandre Lacoste, Alexandre Drouin

NeurIPS Datasets and Benchmarks Track (NeurIPS Datasets), 2024.

An Ecosystem for Web Agents: WorkArena, BrowserGym, AgentLab and more

The BrowserGym ecosystem addresses the growing need for efficient evaluation and benchmarking of web agents, particularly those …

Alexandre Lacoste, Maxime Gasse, Thibault Le Sellier De Chezelles, Massimo Caccia, Léo Boisvert, Megh Thakkar, Alexandre Drouin, Nicolas Chapados

Montreal AI Symposium (MAIS), 2024.

WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?

We study the use of large language model-based agents for interacting with software via web browsers. Unlike prior work, we focus on …

Alexandre Drouin, Maxime Gasse, Massimo Caccia, Issam H. Laradji, Manuel Del Verme, Tom Marty, Léo Boisvert, Megh Thakkar, Quentin Cappart, David Vazquez, Nicolas Chapados, Alexandre Lacoste

International Conference on Machine Learning (ICML), 2024.

WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?

We study the use of large language model-based agents for interacting with software via web browsers. Unlike prior work, we focus on …

Alexandre Drouin, Maxime Gasse, Massimo Caccia, Issam H. Laradji, Manuel Del Verme, Tom Marty, David Vazquez, Nicolas Chapados, Alexandre Lacoste

Workshop at the International Conference of Learning Representation (ICLR), 2024.

The Unsolved Challenges of LLMs in Open-Ended Web Tasks: A Case Study

In this work, we investigate the challenges associated with developing goal-driven AI agents capable of performing open-ended tasks in …

Rim Assouel, Tom Marty, Massimo Caccia, Issam H. Laradji, Alexandre Drouin, Sai Rajeswar Mudumba, Hector Palacios, Quentin Cappart, David Vazquez, Nicolas Chapados, Maxime Gasse, Alexandre Lacoste

Workshop at the Neural Information Processing Systems (NeurIPS), 2023.