ServiceNow ServiceNow AI Research Publication_types 9
ServiceNow AI Research
We analyze the convergence of a novel policy gradient algorithm (referred to as SPMA) for multi-armed bandits and tabular Markov …