9

Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation
The practical utility of causality in decision-making is widely recognized, with causal discovery and inference being inherently …
Causal Discovery with Language Models as Imperfect Experts
Understanding the causal relationships that underlie a system is a fundamental prerequisite to accurate decision-making. In this work, …
Explaining Graph Neural Networks Using Interpretable Local Surrogates
We propose an interpretable local surrogate (ILS) method for understanding the predictions of black-box graph models. Explainability …
OC-NMN: Object-centric Compositional Neural Module Network for Generative Visual Analogical Reasoning
Imagination is a crucial aspect of human intelligence that enables us to combine concepts in novel ways and make sense of new …
Workflow discovery in low data regimes
Text-based dialogues are now widely used to solve real-world problems. In cases where solution strategies are already known, they can …
Multilingual Code Retrieval Without Paired Data: A New Benchmark and Experiments
We seek to overcome limitations to code retrieval quality posed by the scarcity of data containing pairs of code snippets and natural …
SantaCoder: don't reach for the stars!
The BigCode project is an open-scientific collaboration working on the responsible development of large language models for code. This …
Leveraging Human Preferences to Master Poetry
Large language models have been fine-tuned to learn poetry via supervised learning on a dataset containing relevant examples. However, …
In-Context Learning for Text Classification with Many Labels
In-context learning (ICL) using large language models for tasks with many labels is challenging due to the limited context window, …
On the Compositional Generalization Gap of In-Context Learning
Pretrained large generative language models have shown great performance on many tasks, but exhibit low compositional generalization …