Accueil
Équipe
Publications
Open Source
Démos
Évènements
Blog
Carrières
Nous joindre
Français
Français
English
ServiceNow
ServiceNow IA recherche
Équipe
Reza Asad
ServiceNow IA recherche
Reza Asad
Publications
Fast Convergence of Softmax Policy Mirror Ascent
.
Reza Asad
,
Reza Babanezhad
,
Issam H. Laradji
,
Nicolas Le Roux
,
Sharan Vaswani
. At
International Conference on Artificial Intelligence and Statistics (AISTATS), 2025.
PDF
Citation
Fast Convergence of Softmax Policy Mirror Ascent for Bandits & Tabular MDPs
.
Issam H. Laradji
,
Reza Asad
,
Sharan Vaswani
. At
Workshop at the Neural Information Processing Systems (NeurIPS), 2024.
PDF
Citation
Surrogate Minimization: An Optimization Algorithm for Training Large Neural Networks with Model Parallelism
.
Reza Asad
,
Reza Babanezhad
,
Issam H. Laradji
,
Nicolas Le Roux
,
Sharan Vaswani
. At
Workshop at the Neural Information Processing Systems (NeurIPS), 2023.
PDF
Citation
Citation
×