9

CUA-Suite: Expert Trajectories and Pixel-Precise Grounding for Computer-use Agents

Xiangru Jian, Shravan Nayak, Kevin Qinghong Lin, Aarash Feizi, Kaixin Li, Patrice Béchard, Spandana Gella, Sai Rajeswar Mudumba

Workshop at the International Conference of Machine Learning (ICML), 2026.

Hierarchical Retrieval at Scale: Bridging Transparency and Efficiency

Hierarchical Retrieval at Scale: Bridging Transparency and Efficiency

Information retrieval is a core component of many intelligent systems as it enables conditioning of outputs on new and large-scale …

Shubham Gupta, Zichao Li, Tianyi Chen, Cem Subakan, Siva Reddy, Perouz Taslakian, Valentina Zantedeschi

Workshop at the International Conference of Machine Learning (ICML), 2026.

Overcoming the Modality Gap in Context-Aided Forecasting

Context-aided forecasting (CAF) holds promise for integrating domain knowledge and forward-looking information, enabling AI systems to …

Vincent Zhihao Zheng, Étienne Marcotte, Arjun Ashok, Andrew Williams, Lijun Sun, Alexandre Drouin, Valentina Zantedeschi

Workshop at the International Conference of Machine Learning (ICML), 2026.

Beyond Naïve Prompting: Strategies for Improved Zero-shot Context-aided Forecasting with LLMs

Beyond Naïve Prompting: Strategies for Improved Zero-shot Context-aided Forecasting with LLMs

Forecasting in real-world settings requires models to integrate not only historical data but also relevant contextual information, …

Arjun Ashok, Andrew Williams, Vincent Zhihao Zheng, Irina Rish, Nicolas Chapados, Étienne Marcotte, Valentina Zantedeschi, Alexandre Drouin

Workshop at the Neural Information Processing Systems (NeurIPS), 2025.

GitChameleon: Evaluating AI Code Generation Against Python Library Version Incompatibilities

GitChameleon: Evaluating AI Code Generation Against Python Library Version Incompatibilities

The rapid evolution of software libraries presents a significant challenge for code generation models, which must adapt to frequent …

Nizar Islah, Justine Gehring, Diganta Misra, Eilif Muller, Irina Rish, Eilif Benjamin Muller, Massimo Caccia

Workshop at the Neural Information Processing Systems (NeurIPS), 2025.

How to Train Your LLM Web Agent: A Statistical Diagnosis

Large language model (LLM) agents for web interfaces have advanced rapidly, yet open-source systems still lag behind proprietary …

Dheeraj Vattikonda, Santhoshi Ravichandran, Emiliano Penaloza, Hadi Nekoei, Thibault Le Sellier De Chezelles, Megh Thakkar, Nicolas Gontier, Miguel Muñoz-Mármol, Sahar Omidi Shayegan, Stefania Raimondo, Xue Steve Liu, Alexandre Drouin, Alexandre Piche, Alexandre Lacoste, Massimo Caccia

Workshop at the Neural Information Processing Systems (NeurIPS), 2025.

Beyond Naïve Prompting: Strategies for Improved Zero-shot Context-aided Forecasting with LLMs

Beyond Naïve Prompting: Strategies for Improved Zero-shot Context-aided Forecasting with LLMs

Forecasting in real-world settings requires models to integrate not only historical data but also relevant contextual information, …

Arjun Ashok, Andrew Williams, Vincent Zhihao Zheng, Irina Rish, Nicolas Chapados, Étienne Marcotte, Valentina Zantedeschi, Alexandre Drouin

Conference on Language Modeling Workshops, 2025.

Using Scaling Laws for Data Source Utility Estimation in Domain-Specific Pre-Training

Using Scaling Laws for Data Source Utility Estimation in Domain-Specific Pre-Training

We introduce a framework for optimizing domain-specific dataset construction in foundation model training. Specifically, we seek a …

Oleksiy Ostapenko, Charles Guille-Escuret, Luke Kumar, Max Tian, Denis Kocetkov, Gopeshh Subbaraj, Raymond Li, Joel Lamy Poirier, Sébastien Paquet, Torsten Scholak

Conference on Language Modeling Workshops, 2025.

AgentAda: Skill-Adaptive Data Analytics for Tailored Insight Discovery

AgentAda: Skill-Adaptive Data Analytics for Tailored Insight Discovery

We introduce AgentAda, the first LLM-powered analytics agent that can learn and use new analytics skills to extract more specialized …

Amirhossein Abaskohi, Amrutha Ramesh, Shailesh Nanisetty, Chirag Goel, David Vazquez, Christopher Pal, Spandana Gella, Giuseppe Carenini, Issam H. Laradji

Workshop at the Annual Meeting of the Association for Computational Linguistics (ACL), 2025.

Adaptive Diffusion Denoised Smoothing : Certified Robustness via Randomized Smoothing with Differentially Private Guided Denoising Diffusion

We propose Adaptive Diffusion Denoised Smoothing, a method for certifying the predictions of a vision model against adversarial …

Frederick Shpilevskiy, Saiyue Lyu, Krishnamurthy (Dj) Dvijotham, Mathias Lécuyer, Pierre-André Noël

Workshop at the International Conference of Machine Learning (ICML), 2025.