About
People
Publications
Open Source
Demos
Events
Blog
Careers
Contact
English
English
Français
ServiceNow
ServiceNow Research
Tags
Off-policy Learning
ServiceNow Research
Off-policy Learning
Bridging the Gap Between Target Networks and Functional Regularization
Target networks are at the core of recent success in Reinforcement Learning. They stabilize the training by using old parameters to …
Alexandre Piche
,
Valentin Thomas
,
Joseph Marino
,
Gian Maria Marconi
,
Mohammad Emtiyaz Khan
,
Christopher Pal
Transactions on Machine Learning Research (TMLR), 2023.
PDF
Cite
Code
Cite
×