Publications
DRBench: A Realistic Benchmark for Enterprise Deep Research.
Amirhossein Abaskohi,
Tianyi Chen,
Miguel Muñoz-Mármol,
Curtis Fox,
Amrutha Ramesh,
Étienne Marcotte,
Xing Han Lu,
Nicolas Chapados,
Spandana Gella,
Christopher Pal,
Alexandre Drouin,
Issam H. Laradji. At
International Conference on Learning Representations,
2026.
Grounding Computer Use Agents on Human Demonstrations.
Aarash Feizi,
Shravan Nayak,
Xiangru Jian,
Kevin Qinghong Lin,
Kaixin Li,
Rabiul Awal,
Xing Han Lu,
Johan Obando,
Juan A. Rodriguez,
Nicolas Chapados,
David Vazquez,
Adriana Romero Soriano,
Reihaneh Rabbany,
Perouz Taslakian,
Christopher Pal,
Spandana Gella,
Sai Rajeswar Mudumba. At
International Conference on Learning Representations,
2026.
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Document Understanding.
Ahmed Masry,
Juan A. Rodriguez,
Tianyu Zhang,
Suyuchen Wang,
Chao Wang,
Aarash Feizi,
Akshay Kalkunte,
Abhay Puri,
Xiangru Jian,
Pierre-André Noël,
Sathwik Madhusudhan,
Marco Pedersoli,
Bang Liu,
Nicolas Chapados,
Yoshua Bengio,
Enamul Hoque Prince ,
Christopher Pal,
Issam H. Laradji,
David Vazquez,
Perouz Taslakian,
Spandana Gella,
Sai Rajeswar Mudumba. At
Neural Information Processing Systems (NeurIPS),
2025.
EDRA: Enterprise Deep Research Agent.
Jordan Prince Tremblay,
Chris Tyler,
Daniel Tremblay,
Oleh Shliazhko,
Issam H. Laradji,
Christopher Pal,
Gabriel Huang,
Gaurav Sahu,
Miguel Muñoz-Mármol,
Tianyi Chen,
Sébastien Paquet,
Alexandre Drouin,
Krishnamurthy (Dj) Dvijotham,
Jason Stanley,
Nicolas Chapados. At
NOW AI,
2025.
StarUI: Learning to Ground Agentic Perception in Desktop GUIs.
Aarash Feizi,
Shravan Nayak,
Kevin Qinghong Lin,
Kaixin Li,
Rabiul Awal,
Xiangru Jian,
Juan A. Rodriguez,
Nicolas Chapados,
David Vazquez,
Reihaneh Rabbany,
Adriana Romero Soriano,
Perouz Taslakian,
Christopher Pal,
Spandana Gella,
Sai Rajeswar Mudumba. At
NOW AI,
2025.
Silent Sabotage: Injecting Backdoors into AI Agents Through Fine-Tuning.
Léo Boisvert,
Abhay Puri,
Chandra Kiran Reddy Evuru,
Joshua Kazdan,
Avinandan Bose,
Quentin Cappart,
Maryam Fazel,
Sai Rajeswar Mudumba,
Jason Stanley,
Nicolas Chapados,
Alexandre Drouin,
Krishnamurthy (Dj) Dvijotham. At
Workshop at the International Conference of Machine Learning (ICML),
2025.
Context is Key: A Benchmark for Forecasting with Essential Textual Information.
Andrew Williams,
Arjun Ashok,
Étienne Marcotte,
Valentina Zantedeschi,
Jithendaraa Subramanian,
Roland Riachi,
James Requeima,
Alexandre Lacoste,
Irina Rish,
Nicolas Chapados,
Alexandre Drouin. At
International Conference on Machine Learning (ICML),
2025.
UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction.
Shravan Nayak,
Xiangru Jian,
Kevin Lin,
Juan A. Rodriguez,
Motek Kalsi,
Nicolas Chapados,
Tamer Özsu,
Aishwarya Agrawal,
David Vazquez,
Christopher Pal,
Perouz Taslakian,
Spandana Gella,
Sai Rajeswar Mudumba. At
International Conference on Machine Learning (ICML),
2025.
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding.
Ahmed Masry,
Juan A. Rodriguez,
Tianyu Zhang,
Suyuchen Wang,
Chao Wang,
Aarash Feizi,
Akshay Kalkunte,
Abhay Puri,
Xiangru Jian,
Pierre-André Noël,
Sathwik Madhusudhan,
Marco Pedersoli,
Bang Liu,
Nicolas Chapados,
Yoshua Bengio,
Enamul Hoque Prince ,
Christopher Pal,
Issam H. Laradji,
David Vazquez,
Perouz Taslakian,
Spandana Gella,
Sai Rajeswar Mudumba. At
Workshop at the International Conference of Learning Representation (ICLR),
2025.
Societal Alignment Frameworks Can Improve LLM Alignment.
Karolina Stanczak,
Nicholas Meade,
Mehar Bhatia,
Hattie Zhou,
Konstantin Böttinger,
Jeremy Barns,
Jason Stanley,
Nicolas Papernot,
Nicolas Chapados,
Denis Therien,
Timothy P Lillicrap,
Ana Marasovic,
Sylvie Delacroix,
Gillian K Hadfield,
Siva Reddy. At
Workshop at the International Conference of Learning Representation (ICLR),
2025.
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks.
Juan A. Rodriguez,
Xiangru Jian,
Siba Smarak Panigrahi,
Tianyu Zhang,
Aarash Feizi,
Abhay Puri,
Akshay Kalkunte,
Francois Savard,
Ahmed Masry,
Shravan Nayak,
Rabiul Awal,
Mahsa Massoud,
Amirhossein Abaskohi,
Zichao Li,
Suyuchen Wang,
Pierre-André Noël,
Mats L. Richter,
Saverio Vadacchino,
Shubham Agarwal,
Sanket Biswas,
Sara Shanian,
Ying Zhang,
Sathwik Tejaswi Madhusudhan,
João Monteiro,
Krishnamurthy (Dj) Dvijotham,
Torsten Scholak,
Nicolas Chapados,
Sepideh Kharaghani,
Sean Hughes,
Tamer Özsu,
Siva Reddy,
Marco Pedersoli,
Yoshua Bengio,
Christopher Pal,
Issam H. Laradji,
Spandana Gella,
Perouz Taslakian,
David Vazquez,
Sai Rajeswar Mudumba. At
International Conference of Learning Representations (ICLR),
2025.
InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation.
Gaurav Sahu,
Abhay Puri,
Juan A. Rodriguez,
Amirhossein Abaskohi,
Mohammad (Aaron) Chegini ,
Alexandre Drouin,
Perouz Taslakian,
Valentina Zantedeschi,
Alexandre Lacoste,
David Vazquez,
Nicolas Chapados,
Christopher Pal,
Sai Rajeswar Mudumba,
Issam H. Laradji. At
International Conference of Learning Representations (ICLR),
2025.
The BrowserGym Ecosystem for Web Agent Research.
Thibault Le Sellier De Chezelles,
Maxime Gasse,
Alexandre Drouin,
Massimo Caccia,
Léo Boisvert,
Megh Thakkar,
Tom Marty,
Rim Assouel,
Sahar Omidi Shayegan,
Siva Reddy,
Quentin Cappart,
Graham Neubig,
Nicolas Chapados,
Alexandre Lacoste. At
Transactions on Machine Learning Research (TMLR),
2025.
BigDocs: A Permissively-Licensed Dataset for Training Vision-Language Models on Document and Code Tasks.
Juan A. Rodriguez,
Xiangru Jian,
Siba Smarak Panigrahi,
Tianyu Zhang,
Aarash Feizi,
Abhay Puri,
Akshay Kalkunte,
Francois Savard,
Amirhossein Abaskohi,
Ahmed Masry,
Shravan Nayak,
Mahsa Massoud,
Rabiul Awal,
Pierre-André Noël,
Mats L. Richter,
Saverio Vadacchino,
Shubham Agarwal,
Sanket Biswas,
Ying Zhang,
Sathwik Tejaswi Madhusudhan,
João Monteiro,
Krishnamurthy (Dj) Dvijotham,
Torsten Scholak,
Nicolas Chapados,
Sean Hughes,
Tamer Özsu,
Aishwarya Agrawal,
Marco Pedersoli,
Christopher Pal,
Perouz Taslakian,
David Vazquez,
Issam H. Laradji,
Spandana Gella,
Sai Rajeswar Mudumba. At
Workshop at the Neural Information Processing Systems (NeurIPS),
2024.
Context is Key: A Benchmark for Forecasting with Essential Textual Information.
Andrew Williams,
Arjun Ashok,
Étienne Marcotte,
Valentina Zantedeschi,
Jithendaraa Subramanian,
Roland Riachi,
James Requeima,
Alexandre Lacoste,
Irina Rish,
Nicolas Chapados,
Alexandre Drouin. At
Workshop at the Neural Information Processing Systems (NeurIPS),
2024.
Fine-Tuning Web Agents: It Works, But It's Trickier Than You Think.
Massimo Caccia,
Megh Thakkar,
Léo Boisvert,
Thibault Le Sellier De Chezelles,
Alexandre Piche,
Nicolas Chapados,
Alexandre Drouin,
Maxime Gasse,
Alexandre Lacoste. At
Workshop at the Neural Information Processing Systems (NeurIPS),
2024.
Context is Key: A Benchmark for Forecasting with Essential Textual Information.
Andrew Williams,
Arjun Ashok,
Étienne Marcotte,
Valentina Zantedeschi,
Jithendaraa Subramanian,
Roland Riachi,
James Requeima,
Alexandre Lacoste,
Irina Rish,
Nicolas Chapados,
Alexandre Drouin. At
Foundation Models for Time Series,
2024.
Fine-Tuning Web Agents: It Works, But It's Trickier Than You Think.
Massimo Caccia,
Megh Thakkar,
Léo Boisvert,
Thibault Le Sellier De Chezelles,
Alexandre Piche,
Nicolas Chapados,
Alexandre Drouin,
Maxime Gasse,
Alexandre Lacoste. At
NOW AI Conference (NOWAI),
2024.
An Ecosystem for Web Agents: WorkArena, BrowserGym, AgentLab and more.
Alexandre Lacoste,
Maxime Gasse,
Thibault Le Sellier De Chezelles,
Massimo Caccia,
Léo Boisvert,
Megh Thakkar,
Alexandre Drouin,
Nicolas Chapados. At
Montreal AI Symposium (MAIS),
2024.
Context is Key: A Benchmark for Forecasting with Essential Textual Information.
Andrew Williams,
Arjun Ashok,
Étienne Marcotte,
Valentina Zantedeschi,
Jithendaraa Subramanian,
Roland Riachi,
James Requeima,
Alexandre Lacoste,
Irina Rish,
Nicolas Chapados,
Alexandre Drouin. At
Montreal AI Symposium (MAIS),
2024.
WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?.
Alexandre Drouin,
Maxime Gasse,
Massimo Caccia,
Issam H. Laradji,
Manuel Del Verme,
Tom Marty,
Léo Boisvert,
Megh Thakkar,
Quentin Cappart,
David Vazquez,
Nicolas Chapados,
Alexandre Lacoste. At
International Conference on Machine Learning (ICML),
2024.
StarCoder 2 and The Stack v2: The Next Generation.
Anton Lozhkov,
Raymond Li,
Loubna Ben Allal,
Federico Cassano,
Joel Lamy Poirier,
Nouamane Tazi,
Ao Tang,
Dmytro Pykhtar,
Jiawei Liu,
Yuxiang Wei,
Tianyang Liu,
Max Tian,
Denis Kocetkov,
Arthur Zucker,
Younes Belkada,
Zijian Wang,
Dmitry Abulkhanov,
Indraneil Paul,
Zhuang Li,
Wen-Ding Li,
Megan Risdal,
Jia Li,
Terry Yue Zhuo,
Nii Osae Osae Dade,
Lucas Krauß,
Naman Jain,
Yixuan Su,
Xuanli He,
Edoardo Abati,
Yekun Chai,
Xiangru Tang,
Christopher Akiki,
Chenghao Mou,
Binyuan Hui,
Nicolas Patry,
Canwen Xu,
Julian McAuley,
Han Hu,
Torsten Scholak,
Sébastien Paquet,
Jennifer Robinson,
Carolyn Jane Anderson,
Nicolas Chapados,
Mostofa Patwary,
Nima Tajbakhsh,
Yacine Jernite,
Carlos Muñoz Ferrandis,
Lingming Zhang,
Sean Hughes,
Thomas Wolf ,
Arjun Guha,
Leandro von Werra,
Harm de Vries,
Alex Gu,
Armel Zebaze,
Evgenii Zheltonozhskii,
Jian Zhu,
Manan Dey,
Marc Marone,
Mayank Mishra,
Muhtasham Oblokulov,
Olivier Dehaene,
Qian Liu,
Tri Dao,
Wenhao Yu,
Niklas Muennighoff. At
ArXiv,
2024.
Lag-Llama: A Foundation Model for Probabilistic Time Series Forecasting.
Kashif Rasul,
Arjun Ashok,
Marin Bilos,
Andrew Williams,
Arian Khorasani,
George Adamopoulos,
Rishika Bhagwatkar,
Hena Ghonia,
Nadhir Hassen,
Anderson Schneider,
Sahil Garg,
Alexandre Drouin,
Nicolas Chapados,
Yuriy Nevmyvaka,
Irina Rish. At
Workshop at the Neural Information Processing Systems (NeurIPS),
2023.
The Unsolved Challenges of LLMs in Open-Ended Web Tasks: A Case Study.
Rim Assouel,
Tom Marty,
Massimo Caccia,
Issam H. Laradji,
Alexandre Drouin,
Sai Rajeswar Mudumba,
Hector Palacios,
Quentin Cappart,
David Vazquez,
Nicolas Chapados,
Maxime Gasse,
Alexandre Lacoste. At
Workshop at the Neural Information Processing Systems (NeurIPS),
2023.