ServiceNow Research

Reinforcement Learning

Beyond Target Networks: Improving Deep Q-learning with Functional Regularization
Target networks are at the core of recent success in Reinforcement Learning. They stabilize the training by using old parameters to …
Direct Behavior Specification via Constrained Reinforcement Learning
The standard formulation of Reinforcement Learning lacks a practical way of specifying what are admissible and forbidden behaviors. …
Unsupervised Model-based Pre-training for Data-efficient Reinforcement Learning from Pixels
Reinforcement learning (RL) aims at autonomously performing complex tasks. To this end, a reward signal is used to steer the learning …
Scaling up ML-based Black-box Planning with Partial STRIPS Models
A popular approach for sequential decision-making is to perform simulator-based search guided with Machine Learning (ML) methods like …
A Probabilistic Perspective on Reinforcement Learning via Supervised Learning
Reinforcement Learning via Supervised Learning (RvS) only uses supervised techniques to learn desirable behaviors from large datasets. …
Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning
Inducing causal relationships from observations is a classic problem in machine learning. Most work in causality starts from the …
Reinforcement Learning with Random Delays
Action and observation delays commonly occur in many Reinforcement Learning applications, such as remote control scenarios. We study …
Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization
Combinatorial optimization has found applications in numerous fields, from aerospace to transportation planning and economics. The goal …
Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning
In multi-agent reinforcement learning, discovering successful collective behaviors is challenging as it requires exploring a joint …
Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents
As deep reinforcement learning driven by visual perception becomes more widely used there is a growing need to better understand and …