About
People
Publications
Open Source
Demos
Events
Blog
Careers
Contact
English
English
Français
ServiceNow
ServiceNow Research
People
Reza Asad
ServiceNow Research
Reza Asad
Publications
Fast Convergence of Softmax Policy Mirror Ascent
.
Reza Asad
,
Reza Babanezhad
,
Issam H. Laradji
,
Nicolas Le Roux
,
Sharan Vaswani
. At
International Conference on Artificial Intelligence and Statistics (AISTATS), 2025.
PDF
Cite
Fast Convergence of Softmax Policy Mirror Ascent for Bandits & Tabular MDPs
.
Issam H. Laradji
,
Reza Asad
,
Sharan Vaswani
. At
Workshop at the Neural Information Processing Systems (NeurIPS), 2024.
PDF
Cite
Surrogate Minimization: An Optimization Algorithm for Training Large Neural Networks with Model Parallelism
.
Reza Asad
,
Reza Babanezhad
,
Issam H. Laradji
,
Nicolas Le Roux
,
Sharan Vaswani
. At
Workshop at the Neural Information Processing Systems (NeurIPS), 2023.
PDF
Cite
Cite
×