Publications
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation.
Rabiul Awal,
Mahsa Massoud,
Zichao Li,
Aarash Feizi,
Suyuchen Wang,
Christopher Pal,
Aishwarya Agrawal,
David Vazquez,
Perouz Taslakian,
Spandana Gella,
Sai Rajeswar Mudumba. At
Conference on Empirical Methods in Natural Language Processing (EMNLP),
2025.
BigCharts-R1: Enhanced Chart Reasoning with Visual Reinforcement Finetuning.
Ahmed Masry,
Abhay Puri,
Masoud Hashemi,
Juan A. Rodriguez,
Megh Thakkar,
Khyati Mahajan,
Vikas Yadav,
Sathwik Tejaswi Madhusudhan,
Alexandre Piche,
Dzmitry Bahdanau,
Christopher Pal,
David Vazquez,
Enamul Hoque Prince ,
Perouz Taslakian,
Sai Rajeswar Mudumba,
Spandana Gella. At
Conference on Language Modeling (COLM),
2025.
Silent Sabotage: Injecting Backdoors into AI Agents Through Fine-Tuning.
Léo Boisvert,
Abhay Puri,
Chandra Kiran Reddy Evuru,
Joshua Kazdan,
Avinandan Bose,
Quentin Cappart,
Maryam Fazel,
Sai Rajeswar Mudumba,
Jason Stanley,
Nicolas Chapados,
Alexandre Drouin,
Krishnamurthy (Dj) Dvijotham. At
Workshop at the International Conference of Machine Learning (ICML),
2025.
UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction.
Shravan Nayak,
Xiangru Jian,
Kevin Lin,
Juan A. Rodriguez,
Motek Kalsi,
Nicolas Chapados,
Tamer Özsu,
Aishwarya Agrawal,
David Vazquez,
Christopher Pal,
Perouz Taslakian,
Spandana Gella,
Sai Rajeswar Mudumba. At
International Conference on Machine Learning (ICML),
2025.
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation.
Rabiul Awal,
Mahsa Massoud,
Zichao Li,
Aarash Feizi,
Suyuchen Wang,
Christopher Pal,
Aishwarya Agrawal,
David Vazquez,
Siva Reddy,
Juan A. Rodriguez,
Perouz Taslakian,
Sai Rajeswar Mudumba. At
Workshop at the Computer Vision and Pattern Recognition Conference (CVPR),
2025.
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding.
Ahmed Masry,
Juan A. Rodriguez,
Tianyu Zhang,
Suyuchen Wang,
Chao Wang,
Aarash Feizi,
Akshay Kalkunte,
Abhay Puri,
Xiangru Jian,
Pierre-André Noël,
Sathwik Madhusudhan,
Marco Pedersoli,
Bang Liu,
Nicolas Chapados,
Yoshua Bengio,
Enamul Hoque Prince ,
Christopher Pal,
Issam H. Laradji,
David Vazquez,
Perouz Taslakian,
Spandana Gella,
Sai Rajeswar Mudumba. At
Workshop at the International Conference of Learning Representation (ICLR),
2025.
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation.
Rabiul Awal,
Mahsa Massoud,
Zichao Li,
Aarash Feizi,
Suyuchen Wang,
Christopher Pal,
Aishwarya Agrawal,
David Vazquez,
Siva Reddy,
Juan A. Rodriguez,
Perouz Taslakian,
Spandana Gella,
Sai Rajeswar Mudumba. At
Workshop at the International Conference of Learning Representation (ICLR),
2025.
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks.
Juan A. Rodriguez,
Xiangru Jian,
Siba Smarak Panigrahi,
Tianyu Zhang,
Aarash Feizi,
Abhay Puri,
Akshay Kalkunte,
Francois Savard,
Ahmed Masry,
Shravan Nayak,
Rabiul Awal,
Mahsa Massoud,
Amirhossein Abaskohi,
Zichao Li,
Suyuchen Wang,
Pierre-André Noël,
Mats L. Richter,
Saverio Vadacchino,
Shubham Agarwal,
Sanket Biswas,
Sara Shanian,
Ying Zhang,
Sathwik Tejaswi Madhusudhan,
João Monteiro,
Krishnamurthy (Dj) Dvijotham,
Torsten Scholak,
Nicolas Chapados,
Sepideh Kharaghani,
Sean Hughes,
Tamer Özsu,
Siva Reddy,
Marco Pedersoli,
Yoshua Bengio,
Christopher Pal,
Issam H. Laradji,
Spandana Gella,
Perouz Taslakian,
David Vazquez,
Sai Rajeswar Mudumba. At
International Conference of Learning Representations (ICLR),
2025.
InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation.
Gaurav Sahu,
Abhay Puri,
Juan A. Rodriguez,
Amirhossein Abaskohi,
Mohammad (Aaron) Chegini ,
Alexandre Drouin,
Perouz Taslakian,
Valentina Zantedeschi,
Alexandre Lacoste,
David Vazquez,
Nicolas Chapados,
Christopher Pal,
Sai Rajeswar Mudumba,
Issam H. Laradji. At
International Conference of Learning Representations (ICLR),
2025.
BigDocs: A Permissively-Licensed Dataset for Training Vision-Language Models on Document and Code Tasks.
Juan A. Rodriguez,
Xiangru Jian,
Siba Smarak Panigrahi,
Tianyu Zhang,
Aarash Feizi,
Abhay Puri,
Akshay Kalkunte,
Francois Savard,
Amirhossein Abaskohi,
Ahmed Masry,
Shravan Nayak,
Mahsa Massoud,
Rabiul Awal,
Pierre-André Noël,
Mats L. Richter,
Saverio Vadacchino,
Shubham Agarwal,
Sanket Biswas,
Ying Zhang,
Sathwik Tejaswi Madhusudhan,
João Monteiro,
Krishnamurthy (Dj) Dvijotham,
Torsten Scholak,
Nicolas Chapados,
Sean Hughes,
Tamer Özsu,
Aishwarya Agrawal,
Marco Pedersoli,
Christopher Pal,
Perouz Taslakian,
David Vazquez,
Issam H. Laradji,
Spandana Gella,
Sai Rajeswar Mudumba. At
Workshop at the Neural Information Processing Systems (NeurIPS),
2024.
The Unsolved Challenges of LLMs in Open-Ended Web Tasks: A Case Study.
Rim Assouel,
Tom Marty,
Massimo Caccia,
Issam H. Laradji,
Alexandre Drouin,
Sai Rajeswar Mudumba,
Hector Palacios,
Quentin Cappart,
David Vazquez,
Nicolas Chapados,
Maxime Gasse,
Alexandre Lacoste. At
Workshop at the Neural Information Processing Systems (NeurIPS),
2023.