About
People
Publications
Open Source
Demos
Events
Blog
Careers
Contact
English
English
Français
ServiceNow
ServiceNow AI Research
Tags
Reinforcement Learning
ServiceNow AI Research
Reinforcement Learning
Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization
Combinatorial optimization has found applications in numerous fields, from aerospace to transportation planning and economics. The goal …
Quentin Cappart
,
Thierry Moisan
,
Louis-Martin Rousseau
,
Isabeau Prémont-Schwarz
,
Andre Cire
Association for the Advancement of Artificial Intelligence (AAAI), 2021.
PDF
Cite
Code
Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning
In multi-agent reinforcement learning, discovering successful collective behaviors is challenging as it requires exploring a joint …
Julien Roy
,
Paul Barde
,
Félix G. Harvey
,
Derek Nowrouzezahrai
,
Christopher Pal
Conference on Neural Information Processing Systems (NeurIPS), 2020.
PDF
Cite
Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents
As deep reinforcement learning driven by visual perception becomes more widely used there is a growing need to better understand and …
Christian Rupprecht
,
Cyril Ibrahim
,
Christopher Pal
International Conference on Learning Representations (ICLR), 2020.
PDF
Cite
Reinforced Active Learning for Image Segmentation
Learning-based approaches for semantic segmentation have two inherent challenges. First, acquiring pixel-wise labels is expensive and …
Arantxa Casanova
,
Pedro O. Pinheiro
,
Negar Rostamzadeh
,
Christopher Pal
International Conference on Learning Representations (ICLR), 2020.
PDF
Cite
Code
Learning Reward Machines for Partially Observable Reinforcement Learning
Reward Machines (RMs) provide a structured, automata-based representation of a reward function that enables a Reinforcement Learning …
Rodrigo Toro Icarte
,
Ethan Waldie
,
Toryn Q. Klassen
,
Richard Valenzano
,
Margarita P. Castro
,
Sheila A. McIlraith
Conference on Neural Information Processing Systems (NeurIPS), 2019.
PDF
Cite
Real-Time Reinforcement Learning
Markov Decision Processes (MDPs), the mathematical framework underlying most algorithms in Reinforcement Learning (RL), are often used …
Simon Ramstedt
,
Christopher Pal
Conference on Neural Information Processing Systems (NeurIPS), 2019.
PDF
Cite
Code
Searching for Markovian Subproblems to Address Partially Observable Reinforcement Learning
In partially observable environments, an agent’s policy should often be a function of the history of its interaction with the …
Rodrigo Toro Icarte
,
Ethan Waldie
,
Toryn Q. Klassen
,
Richard Valenzano
,
Margarita P. Castro
,
Sheila A. McIlraith
Multi-disciplinary Conference on Reinforcement Learning and Decision Making (RLDM), 2019.
PDF
Cite
BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning
Allowing humans to interactively train artificial agents to understand language instructions is desirable for both practical and …
Maxime Chevalier-Boisvert
,
Dzmitry Bahdanau
,
Salem Lahlou
,
Lucas Willems
,
Chitwan Saharia
,
Thien Huu Nguyen
,
Yoshua Bengio
International Conference on Learning Representations (ICLR), 2019.
PDF
Cite
Code
Improving Optimization Bounds using Machine Learning: Decision Diagrams meet Deep Reinforcement Learning
Finding tight bounds on the optimal solution is a critical element of practical solution methods for discrete optimization problems. In …
Quentin Cappart
,
Emmanuel Goutierre
,
David Bergman
,
Louis-Martin Rousseau
Association for the Advancement of Artificial Intelligence (AAAI), 2019.
PDF
Cite
Code
Reinforced Imitation in Heterogeneous Action Space
Imitation learning is an effective alternative approach to learn a policy when the reward function is sparse. In this paper, we …
Konrad Zolna
,
Negar Rostamzadeh
,
Yoshua Bengio
,
Sungjin Ahn
,
Pedro O. Pinheiro
Workshop at the Neural Information Processing Systems (NeurIPS), 2018.
PDF
Cite
«
»
Cite
×