Optimization

We analyze the convergence of a novel policy gradient algorithm (referred to as SPMA) for multi-armed bandits and tabular Markov …

International Conference on Artificial Intelligence and Statistics (AISTATS), 2025.

Strategic bidding problems have gained a lot of attention with the introduction of deregulated electricity markets where producers and …

Computers & Operations Research (COR), 2025.

Training large language models (LLMs) for pretraining or adapting to new tasks and domains has become increasingly critical as their …

Issam H. Laradji, Amrutha Ramesh, Mark Schmidt

Workshop at the Neural Information Processing Systems (NeurIPS), 2024.

We analyze the convergence of a novel policy gradient algorithm (referred to as SPMA) for multi-armed bandits and tabular Markov …

Issam H. Laradji, Reza Asad, Sharan Vaswani

Workshop at the Neural Information Processing Systems (NeurIPS), 2024.

Direct Preference Optimization (DPO) is an effective technique that leverages pairwise preference data (usually one chosen and rejected …

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.

Early Exiting (EE) is a promising technique for speeding up inference at the cost of limited performance loss. It adaptively allocates …

Workshop at the International Conference of Machine Learning (ICML), 2024.

Optimizing large memory-intensive neural networks requires distributing its layers across multiple GPUs (referred to as model …

Workshop at the Neural Information Processing Systems (NeurIPS), 2023.

We investigate the convergence of stochastic mirror descent (SMD) under interpolation in relatively smooth and smooth convex …

Transactions on Machine Learning Research (TMLR), 2023.

Block coordinate descent (BCD) methods are widely used for large-scale numerical optimization because of their cheap iteration costs, …

julie nutini, Issam H. Laradji, Mark Schmidt

International Conference on Machine Learning (ICML), 2023.

We propose a continuous optimization framework for discovering a latent directed acyclic graph (DAG) from observational data. Our …

International Conference of Learning Representations (ICLR), 2023.