Explainability

Language model activations entangle concepts that mediate their behavior, making it difficult to interpret these factors, which has …

Neural Information Processing Systems (NeurIPS), 2025.

Explainability and transparency of AI systems are undeniably important, leading to several research studies and tools addressing them. …

Conference on Human Factors in Computing Systems (ACM-CHI), 2024.

We propose an interpretable local surrogate (ILS) method for understanding the predictions of black-box graph models. Explainability …

Workshop at the International Conference on Machine Learning (ICML), 2023.

We outline three research directions towards the practical implementation of explainable, sensible and virtuous chatbots for the …

Gabriel Huang, Valérie Bécaert, David Vazquez

Montreal AI Symposium (MAIS), 2022.

Black-box machine learning (ML) models have become increasingly popular in practice. They can offer great performance, especially in …

Marc-Etienne Brunet , Masoud Hashemi

Montreal AI Symposium (MAIS), 2022.

Explainability for machine learning models has gained considerable attention within the research community given the importance of …

International Conference on Computer Vision (ICCV), 2021.

In this work, we focus on the use of influence functions to identify relevant training examples that one might hope …

International Conference on Artificial Intelligence and Statistics (AISTATS), 2020.