9

Using Confounded Data in Offline RL
In this work we consider the problem of confounding in offline RL, also referred to as the delusion problem. While it is known that …
Countering Language Drift with KL Regularization
End-to-end interactive learning of dialogue systems has been all-but-abandoned in favour of other approaches using more labelled data, …
A Planning based Neural-Symbolic Approach for Embodied Instruction Following
The ALFRED environment features an embodied agent following instructions and accomplishing tasks in simulated home environments. …
Continual Learning with self-selecting specialized modules through expansion and pruning
Continual learning (CL) aims to design algorithms that can learn from non-stationarystreams of stationary tasks without forgetting. …
Data Augmentation for Intent Classification with Off-the-shelf Large Language Models
Data augmentation is a widely employed technique to alleviate the problem of data scarcity. In this work, we propose a prompting-based …
Explainable, Sensible and Virtuous Workplace Chatbots
We outline three research directions towards the practical implementation of explainable, sensible and virtuous chatbots for the …
Explaining by Example: A Practitioner’s Perspective
Black-box machine learning (ML) models have become increasingly popular in practice. They can offer great performance, especially in …
Leveraging Activation Patterns to Define Classifiers Able to Detect and Reject Anomalies
In this work, we introduce models that perform comparably with state-of-the-art alternatives in terms of prediction accuracy while …
S-LLM: Semi-Supervised Large Language Model for Chat Summarization
As producing high-quality summaries of chat dialogues currently requires large labeled datasets, we propose a method to efficiently …
TACTiS: Transformer-Attentional Copulas for Time Series
The estimation of time-varying quantities is a fundamental component of decision making in fields such as healthcare and finance. …