Theory of Machine Learning

Optimizing large memory-intensive neural networks requires distributing its layers across multiple GPUs (referred to as model …

Workshop at the Neural Information Processing Systems (NeurIPS), 2023.

We investigate the convergence of stochastic mirror descent (SMD) under interpolation in relatively smooth and smooth convex …

Transactions on Machine Learning Research (TMLR), 2023.

Block coordinate descent (BCD) methods are widely used for large-scale numerical optimization because of their cheap iteration costs, …

julie nutini, Issam H. Laradji, Mark Schmidt

International Conference on Machine Learning (ICML), 2023.

We propose a continuous optimization framework for discovering a latent directed acyclic graph (DAG) from observational data. Our …

International Conference of Learning Representations (ICLR), 2023.

Integrating hyperbolic representations with Deep Reinforcement Learning (DRL) has recently been proposed as a promising approach for …

ICLR, Tiny Papers, 2023.