9

DoomArena: A framework for Testing AI Agents Against Evolving Security Threats

DoomArena: A framework for Testing AI Agents Against Evolving Security Threats

We present DoomArena, a security evaluation framework for AI agents. DoomArena is designed on three principles: 1) It is a …

Léo Boisvert, Abhay Puri, Gabriel Huang, Mihir Bansal, Chandra Kiran Reddy Evuru, Avinandan Bose, Quentin Cappart, Maryam Fazel, Alexandre Lacoste, Alexandre Drouin, Jason Stanley, Krishnamurthy (Dj) Dvijotham

Workshop at the International Conference of Machine Learning (ICML), 2025.

How to Train Your LLM Web Agent: A Statistical Diagnosis (Oral)

Large language model (LLM) agents for web interfaces have advanced rapidly, yet open-source systems still lag behind proprietary …

Dheeraj Vattikonda, Santhoshi Ravichandran, Emiliano Penaloza, Hadi Nekoei, Thibault Le Sellier De Chezelles, Megh Thakkar, Nicolas Gontier, Miguel Muñoz-Mármol, Sahar Omidi Shayegan, Stefania Raimondo, Xue Steve Liu, Alexandre Drouin, Alexandre Piche, Alexandre Lacoste, Massimo Caccia

Workshop at the International Conference of Machine Learning (ICML), 2025.

Silent Sabotage: Injecting Backdoors into AI Agents Through Fine-Tuning

Silent Sabotage: Injecting Backdoors into AI Agents Through Fine-Tuning

The rise of AI agents that can use tools, browse the web and interact with computers on behalf of a user, has sparked strong interest …

Léo Boisvert, Abhay Puri, Chandra Kiran Reddy Evuru, Joshua Kazdan, Avinandan Bose, Quentin Cappart, Maryam Fazel, Sai Rajeswar Mudumba, Jason Stanley, Nicolas Chapados, Alexandre Drouin, Krishnamurthy (Dj) Dvijotham

Workshop at the International Conference of Machine Learning (ICML), 2025.

WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation

Understanding diverse web data and automating web development presents an exciting challenge for agentic AI. While existing benchmarks …

Rabiul Awal, Mahsa Massoud, Zichao Li, Aarash Feizi, Suyuchen Wang, Christopher Pal, Aishwarya Agrawal, David Vazquez, Siva Reddy, Juan A. Rodriguez, Perouz Taslakian, Sai Rajeswar Mudumba

Workshop at the Computer Vision and Pattern Recognition Conference (CVPR), 2025.

Backpropagating from Customer Success

Backpropagating from Customer Success

How do we measure the real performance of AI in enterprise—beyond just model performance? This work introduces a research project …

Midam Kim, Fabio Casati, Darrell Penta, Ihnaee Choi, Minyoung Kim

Conference on Human Factors in Computing Systems (ACM-CHI), 2025.

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Aligning visual features with language embeddings is a key challenge in vision-language models (VLMs). The performance of such models …

Ahmed Masry, Juan A. Rodriguez, Tianyu Zhang, Suyuchen Wang, Chao Wang, Aarash Feizi, Akshay Kalkunte, Abhay Puri, Xiangru Jian, Pierre-André Noël, Sathwik Madhusudhan, Marco Pedersoli, Bang Liu, Nicolas Chapados, Yoshua Bengio, Enamul Hoque Prince , Christopher Pal, Issam H. Laradji, David Vazquez, Perouz Taslakian, Spandana Gella, Sai Rajeswar Mudumba

Workshop at the International Conference of Learning Representation (ICLR), 2025.

Learning to Defer for Causal Discovery with Imperfect Experts

Learning to Defer for Causal Discovery with Imperfect Experts

Integrating expert knowledge, e.g. from large language models, into causal discovery algorithms can be challenging when the knowledge …

Oscar Clivio, Divyat Mahajan, Perouz Taslakian, Sara Magliacane, Ioannis Mitliagkas, Valentina Zantedeschi, Alexandre Drouin

Workshop at the International Conference of Learning Representation (ICLR), 2025.

No, of course I can! Refusal Mechanisms Can Be Exploited Using Harmless Fine-Tuning Data

Leading language model (LM) providers like OpenAI and Google offer fine-tuning APIs that allow customers to adapt LMs for specific use …

Joshua Kazdan, Krishnamurthy (Dj) Dvijotham, Sanmi Koyejo

Workshop at the International Conference of Learning Representation (ICLR), 2025.

Societal Alignment Frameworks Can Improve LLM Alignment

Recent progress in large language models (LLMs) has focused on producing responses that meet human expectations and align with shared …

Karolina Stanczak, Nicholas Meade, Mehar Bhatia, Hattie Zhou, Konstantin Böttinger, Jeremy Barns, Jason Stanley, Nicolas Papernot, Nicolas Chapados, Denis Therien, Timothy P Lillicrap, Ana Marasovic, Sylvie Delacroix, Gillian K Hadfield, Siva Reddy

Workshop at the International Conference of Learning Representation (ICLR), 2025.

The Landscape of Causal Discovery Data: Grounding Causal Discovery in Real-World Applications

The Landscape of Causal Discovery Data: Grounding Causal Discovery in Real-World Applications

Causal discovery aims to automatically uncover causal relationships from data, a capability with significant potential across many …

Philippe Brouillard, Chandler Squires, Jonas Wahl, Konrad P. Kording, Karen Sachs, Alexandre Drouin, Dhanya Sridhar

Workshop at the International Conference of Learning Representation (ICLR), 2025.