Accueil
Équipe
Publications
Open Source
Démos
Évènements
Blog
Carrières
Nous joindre
Français
Français
English
ServiceNow
ServiceNow recherche
Tags
Reinforcement Learning
ServiceNow recherche
Reinforcement Learning
Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization
Combinatorial optimization has found applications in numerous fields, from aerospace to transportation planning and economics. The goal …
Quentin Cappart
,
Thierry Moisan
,
Louis-Martin Rousseau
,
Isabeau Prémont-Schwarz
,
Andre Cire
Association for the Advancement of Artificial Intelligence (AAAI), 2021.
PDF
Citation
Code
Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning
In multi-agent reinforcement learning, discovering successful collective behaviors is challenging as it requires exploring a joint …
Julien Roy
,
Paul Barde
,
Félix G. Harvey
,
Derek Nowrouzezahrai
,
Christopher Pal
Conference on Neural Information Processing Systems (NeurIPS), 2020.
PDF
Citation
Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents
As deep reinforcement learning driven by visual perception becomes more widely used there is a growing need to better understand and …
Christian Rupprecht
,
Cyril Ibrahim
,
Christopher Pal
International Conference on Learning Representations (ICLR), 2020.
PDF
Citation
Reinforced Active Learning for Image Segmentation
Learning-based approaches for semantic segmentation have two inherent challenges. First, acquiring pixel-wise labels is expensive and …
Arantxa Casanova
,
Pedro O. Pinheiro
,
Negar Rostamzadeh
,
Christopher Pal
International Conference on Learning Representations (ICLR), 2020.
PDF
Citation
Code
Learning Reward Machines for Partially Observable Reinforcement Learning
Reward Machines (RMs) provide a structured, automata-based representation of a reward function that enables a Reinforcement Learning …
Rodrigo Toro Icarte
,
Ethan Waldie
,
Toryn Q. Klassen
,
Richard Valenzano
,
Margarita P. Castro
,
Sheila A. McIlraith
Conference on Neural Information Processing Systems (NeurIPS), 2019.
PDF
Citation
Real-Time Reinforcement Learning
Markov Decision Processes (MDPs), the mathematical framework underlying most algorithms in Reinforcement Learning (RL), are often used …
Simon Ramstedt
,
Christopher Pal
Conference on Neural Information Processing Systems (NeurIPS), 2019.
PDF
Citation
Code
Searching for Markovian Subproblems to Address Partially Observable Reinforcement Learning
In partially observable environments, an agent’s policy should often be a function of the history of its interaction with the …
Rodrigo Toro Icarte
,
Ethan Waldie
,
Toryn Q. Klassen
,
Richard Valenzano
,
Margarita P. Castro
,
Sheila A. McIlraith
Multi-disciplinary Conference on Reinforcement Learning and Decision Making (RLDM), 2019.
PDF
Citation
BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning
Allowing humans to interactively train artificial agents to understand language instructions is desirable for both practical and …
Maxime Chevalier-Boisvert
,
Dzmitry Bahdanau
,
Salem Lahlou
,
Lucas Willems
,
Chitwan Saharia
,
Thien Huu Nguyen
,
Yoshua Bengio
International Conference on Learning Representations (ICLR), 2019.
PDF
Citation
Code
Improving Optimization Bounds using Machine Learning: Decision Diagrams meet Deep Reinforcement Learning
Finding tight bounds on the optimal solution is a critical element of practical solution methods for discrete optimization problems. In …
Quentin Cappart
,
Emmanuel Goutierre
,
David Bergman
,
Louis-Martin Rousseau
Association for the Advancement of Artificial Intelligence (AAAI), 2019.
PDF
Citation
Code
Reinforced Imitation in Heterogeneous Action Space
Imitation learning is an effective alternative approach to learn a policy when the reward function is sparse. In this paper, we …
Konrad Zolna
,
Negar Rostamzadeh
,
Yoshua Bengio
,
Sungjin Ahn
,
Pedro O. Pinheiro
Workshop at the Neural Information Processing Systems (NeurIPS), 2018.
PDF
Citation
«
»
Citation
×