About
People
Publications
Open Source
Demos
Events
Blog
Careers
Contact
English
English
Français
ServiceNow
ServiceNow AI Research
Tags
Reinforcement Learning
ServiceNow AI Research
Reinforcement Learning
Searching for Markovian Subproblems to Address Partially Observable Reinforcement Learning
In partially observable environments, an agent’s policy should often be a function of the history of its interaction with the …
Rodrigo Toro Icarte
,
Ethan Waldie
,
Toryn Q. Klassen
,
Richard Valenzano
,
Margarita P. Castro
,
Sheila A. McIlraith
Multi-disciplinary Conference on Reinforcement Learning and Decision Making (RLDM), 2019.
PDF
Cite
BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning
Allowing humans to interactively train artificial agents to understand language instructions is desirable for both practical and …
Maxime Chevalier-Boisvert
,
Dzmitry Bahdanau
,
Salem Lahlou
,
Lucas Willems
,
Chitwan Saharia
,
Thien Huu Nguyen
,
Yoshua Bengio
International Conference on Learning Representations (ICLR), 2019.
PDF
Cite
Improving Optimization Bounds using Machine Learning: Decision Diagrams meet Deep Reinforcement Learning
Finding tight bounds on the optimal solution is a critical element of practical solution methods for discrete optimization problems. In …
Quentin Cappart
,
Emmanuel Goutierre
,
David Bergman
,
Louis-Martin Rousseau
Association for the Advancement of Artificial Intelligence (AAAI), 2019.
PDF
Cite
Reinforced Imitation in Heterogeneous Action Space
Imitation learning is an effective alternative approach to learn a policy when the reward function is sparse. In this paper, we …
Konrad Zolna
,
Negar Rostamzadeh
,
Yoshua Bengio
,
Sungjin Ahn
,
Pedro O. Pinheiro
Workshop at the Neural Information Processing Systems (NeurIPS), 2018.
PDF
Cite
LTL and Beyond: Formal Languages for Reward Function Specification in Reinforcement Learning
In Reinforcement Learning (RL), an agent is guided by the rewards it receives from the reward function. Unfortunately, it may take many …
Alberto Camacho
,
Rodrigo Toro Icarte
,
Toryn Q. Klassen
,
Richard Valenzano
,
Sheila A. McIlraith
International Join Conference on Artificial Intelligence (IJCAI), 2018.
PDF
Cite
Teaching Multiple Tasks to an RL Agent using LTL
This paper examines the problem of how to teach multiple tasks to a Reinforcement Learning (RL) agent. To this end, we use Linear …
Rodrigo Toro Icarte
,
Toryn Q. Klassen
,
Richard Valenzano
,
Sheila A. McIlraith
Autonomous Agents and Multiagent Systems (AAMAS), 2018.
PDF
Cite
Using Reward Machines for High-Level Task Specification and Decomposition in Reinforcement Learning
In this paper we propose Reward Machines – a type of finite state machine that supports the spec- ification of reward functions while …
Rodrigo Toro Icarte
,
Toryn Q. Klassen
,
Richard Valenzano
,
Sheila A. McIlraith
International Conference on Machine Learning (ICML), 2018.
PDF
Cite
Advice-Based Exploration in Model-Based Reinforcement Learning
Convergence to an optimal policy using model-based rein- forcement learning can require significant exploration of the environment. In …
Rodrigo Toro Icarte
,
Toryn Q. Klassen
,
Richard Valenzano
,
Sheila A. McIlraith
Canadian Conference on AI, 2018.
PDF
Cite
«
Cite
×