Accueil
Équipe
Publications
Open Source
Démos
Évènements
Blog
Carrières
Nous joindre
Français
Français
English
ServiceNow
ServiceNow recherche
Tags
Off-policy Learning
ServiceNow recherche
Off-policy Learning
Bridging the Gap Between Target Networks and Functional Regularization
Target networks are at the core of recent success in Reinforcement Learning. They stabilize the training by using old parameters to …
Alexandre Piche
,
Valentin Thomas
,
Joseph Marino
,
Gian Maria Marconi
,
Mohammad Emtiyaz Khan
,
Christopher Pal
Transactions on Machine Learning Research (TMLR), 2023.
PDF
Citation
Code
Citation
×