ServiceNow recherche

Off-policy Learning

Bridging the Gap Between Target Networks and Functional Regularization
Target networks are at the core of recent success in Reinforcement Learning. They stabilize the training by using old parameters to …