9

Contrastive Self-supervision Defines General-Purpose Similarity Functions
Handling out-of-distribution (OOD) and adversarial inputs has become a major stake in the real-world deployment of machine learning …
Exploring the Design Space of Generative Diffusion Processes for Sparse Graphs
We extend score-based generative diffusion processes (GDPs) to sparse graphs and other inherently discrete data, with a focus on …
Implicit Offline Reinforcement Learning via Supervised Learning
Offline Reinforcement Learning (RL) via Supervised Learning is a simple and effective way to learn robotic skills from a dataset of …
Learning Discrete Directed Acyclic Graphs via Backpropagation
Recently continuous relaxations have been proposed in order to learn directed acyclic graphs (DAGs) by backpropagation, instead of …
Using Confounded Data in Offline RL
In this work we consider the problem of confounding in offline RL, also referred to as the delusion problem. While it is known that …
Countering Language Drift with KL Regularization
End-to-end interactive learning of dialogue systems has been all-but-abandoned in favour of other approaches using more labelled data, …
A Planning based Neural-Symbolic Approach for Embodied Instruction Following
The ALFRED environment features an embodied agent following instructions and accomplishing tasks in simulated home environments. …
Continual Learning with self-selecting specialized modules through expansion and pruning
Continual learning (CL) aims to design algorithms that can learn from non-stationarystreams of stationary tasks without forgetting. …
Data Augmentation for Intent Classification with Off-the-shelf Large Language Models
Data augmentation is a widely employed technique to alleviate the problem of data scarcity. In this work, we propose a prompting-based …
Explainable, Sensible and Virtuous Workplace Chatbots
We outline three research directions towards the practical implementation of explainable, sensible and virtuous chatbots for the …