ServiceNow Research

Offline Reinforcement Learning

Using Confounded Data in Offline RL
In this work we consider the problem of confounding in offline RL, also referred to as the delusion problem. While it is known that …
A Probabilistic Perspective on Reinforcement Learning via Supervised Learning
Reinforcement Learning via Supervised Learning (RvS) only uses supervised techniques to learn desirable behaviors from large datasets. …