Reinforcement Learning

I present diffusion models as part of a family of machine learning techniques that withhold information from a model’s input and train …

Pierre-André Noël

ICLR Blogposts 2026, 2026.

Transactions on Machine Learning Research (TMLR), 2026.

Chart understanding is critical for ServiceNow for data analysis, reason over visualizations, such as interpreting trends, identifying …

NOW AI, 2025.

Reinforcement Learning (RL) is increasingly utilized to enhance the reasoning capabilities of Large Language Models (LLMs). However, …

NOW AI, 2025.

In order to be deployed safely, Large Language Models (LLMs) must be capable of dynamically adapting their behavior based on their …

Transactions on Machine Learning Research (TMLR), 2025.

Learning generalist agents, able to solve multitudes of tasks in different domains is a long-standing problem. Reinforcement learning …

Neural Information Processing Systems (NeurIPS), 2024.

The ability to predict outcomes of interactions between embodied agents and objects is paramount in the robotic setting. While …

Workshop at the Neural Information Processing Systems (NeurIPS), 2024.

In order to safely deploy Large Language Models (LLMs), they must be capable of dynamically adapting their behavior based on their …

Workshop at the International Conference of Learning Representation (ICLR), 2024.

Target networks are at the core of recent success in Reinforcement Learning. They stabilize the training by using old parameters to …

Transactions on Machine Learning Research (TMLR), 2023.

In the presence of confounding, naively using off-the-shelf offline reinforcement learning (RL) algorithms leads to sub-optimal …

Transactions on Machine Learning Research (TMLR), 2023.