Reinforcement Learning

Scaling up ML-based Black-box Planning with Partial STRIPS Models

A popular approach for sequential decision-making is to perform simulator-based search guided with Machine Learning (ML) methods like …

Matias Greco, Alvaro Torralba, Jorge Baier, Hector Palacios

ICAPS'22 Workshop on Reliable Data-Driven Planning and Scheduling, 2022.

A Probabilistic Perspective on Reinforcement Learning via Supervised Learning

Reinforcement Learning via Supervised Learning (RvS) only uses supervised techniques to learn desirable behaviors from large datasets. …

Alexandre Piche, Rafael Pardinas, David Vazquez, Christopher Pal

Workshop at the International Conference on Learning Representations (ICLR), 2022.

Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning

Inducing causal relationships from observations is a classic problem in machine learning. Most work in causality starts from the …

Nan Rosemary Ke, Aniket Didolkar, Sarthak Mittal, Anirudh Goyal, Guillaume Lajoie, Danilo Rezende, Yoshua Bengio, Christopher Pal, Stefan Bauer , Michael C. Mozer

Conference on Neural Information Processing Systems (NeurIPS), 2021.

Reinforcement Learning with Random Delays

Action and observation delays commonly occur in many Reinforcement Learning applications, such as remote control scenarios. We study …

Simon Ramstedt, Yann Bouteiller, Giovanni Beltrame, Christopher Pal, Jonathan Binas

International Conference on Learning Representations (ICLR), 2021.

Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization

Combinatorial optimization has found applications in numerous fields, from aerospace to transportation planning and economics. The goal …

Quentin Cappart, Thierry Moisan, Louis-Martin Rousseau, Isabeau Prémont-Schwarz, Andre Cire

Association for the Advancement of Artificial Intelligence (AAAI), 2021.

Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning

In multi-agent reinforcement learning, discovering successful collective behaviors is challenging as it requires exploring a joint …

Julien Roy, Paul Barde, Félix G. Harvey, Derek Nowrouzezahrai, Christopher Pal

Conference on Neural Information Processing Systems (NeurIPS), 2020.

Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents

As deep reinforcement learning driven by visual perception becomes more widely used there is a growing need to better understand and …

Christian Rupprecht, Cyril Ibrahim, Christopher Pal

International Conference on Learning Representations (ICLR), 2020.

Reinforced Active Learning for Image Segmentation

Learning-based approaches for semantic segmentation have two inherent challenges. First, acquiring pixel-wise labels is expensive and …

Arantxa Casanova, Pedro O. Pinheiro, Negar Rostamzadeh, Christopher Pal

International Conference on Learning Representations (ICLR), 2020.

Learning Reward Machines for Partially Observable Reinforcement Learning

Reward Machines (RMs) provide a structured, automata-based representation of a reward function that enables a Reinforcement Learning …

Rodrigo Toro Icarte, Ethan Waldie, Toryn Q. Klassen, Richard Valenzano, Margarita P. Castro, Sheila A. McIlraith

Conference on Neural Information Processing Systems (NeurIPS), 2019.

Real-Time Reinforcement Learning

Markov Decision Processes (MDPs), the mathematical framework underlying most algorithms in Reinforcement Learning (RL), are often used …

Simon Ramstedt, Christopher Pal

Conference on Neural Information Processing Systems (NeurIPS), 2019.