Accueil
Équipe
Publications
Open Source
Démos
Évènements
Blog
Carrières
Nous joindre
Français
Français
English
ServiceNow
ServiceNow recherche
Tags
Reinforcement Learning
ServiceNow recherche
Reinforcement Learning
Multimodal foundation world models for generalist embodied agents
Learning generalist agents, able to solve multitudes of tasks in different domains is a long-standing problem. Reinforcement learning …
Pietro Mazzaglia
,
Tim Verbelen
,
Bart Dhoedt
,
Aaron Courville
,
Sai Rajeswar Mudumba
Neural Information Processing Systems (NeurIPS), 2024.
PDF
Citation
Code
Representing Positional Information in Generative World Models for Object Manipulation
The ability to predict outcomes of interactions between embodied agents and objects is paramount in the robotic setting. While …
Stefano Ferraro
,
Pietro Mazzaglia
,
Tim Verbelen
,
Sai Rajeswar Mudumba
Workshop at the Neural Information Processing Systems (NeurIPS), 2024.
PDF
Citation
Self-evaluation and self-prompting to improve the reliability of LLMs
In order to safely deploy Large Language Models (LLMs), they must be capable of dynamically adapting their behavior based on their …
Alexandre Piche
,
Aristides Milios
,
Dzmitry Bahdanau
,
Christopher Pal
Workshop at the International Conference of Learning Representation (ICLR), 2024.
PDF
Citation
Vidéo
Bridging the Gap Between Target Networks and Functional Regularization
Target networks are at the core of recent success in Reinforcement Learning. They stabilize the training by using old parameters to …
Alexandre Piche
,
Valentin Thomas
,
Joseph Marino
,
Gian Maria Marconi
,
Mohammad Emtiyaz Khan
,
Christopher Pal
Transactions on Machine Learning Research (TMLR), 2023.
PDF
Citation
Code
Using Confounded Data in Latent Model-Based Reinforcement Learning
In the presence of confounding, naively using off-the-shelf offline reinforcement learning (RL) algorithms leads to sub-optimal …
Maxime Gasse
,
Damien Grasset
,
Pierre-Yves Oudeyer
,
Guillaume Gaudron
Transactions on Machine Learning Research (TMLR), 2023.
PDF
Citation
Code
Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels
Controlling artificial agents from visual sensory data is an arduous task. Reinforcement learning (RL) algorithms can succeed but …
Sai Rajeswar Mudumba
,
Pietro Mazzaglia
,
Tim Verbelen
,
Alexandre Piche
,
Bart Dhoedt
,
Aaron Courville
,
Alexandre Lacoste
International Conference on Machine Learning (ICML), 2023.
PDF
Citation
Code
Choreographer: Learning and Adapting Skills in Imagination
Unsupervised skill learning aims to learn a rich repertoire of behaviors without external supervision, providing artificial agents with …
Pietro Mazzaglia
,
Tim Verbelen
,
Bart Dhoedt
,
Alexandre Lacoste
,
Sai Rajeswar Mudumba
International Conference of Learning Representations (ICLR), 2023.
PDF
Citation
Code
Deep Hyperbolic Reinforcement Learning for Continuous Control
Integrating hyperbolic representations with Deep Reinforcement Learning (DRL) has recently been proposed as a promising approach for …
Omar Salemohamed
,
Edoardo Cetin
,
Sai Rajeswar Mudumba
,
Arnab Mondal
ICLR, Tiny Papers, 2023.
PDF
Citation
Leveraging Human Preferences to Master Poetry
Large language models have been fine-tuned to learn poetry via supervised learning on a dataset containing relevant examples. However, …
Rafael Pardinas
,
Gabriel Huang
,
David Vazquez
,
Alexandre Piche
AAAI Workshops, 2023.
PDF
Citation
Haptics-based Curiosity for Sparse-reward Tasks
Robots in many real-world settings have access to force/torque sensors in their gripper and tactile sensing is often necessary in tasks …
Sai Rajeswar Mudumba
,
Cyril Ibrahim
,
Nitin Surya
,
Florian Golemo
,
David Vazquez
,
Aaron Courville
,
Pedro O. Pinheiro
Conference on Robot Learning (CoRL), 2022.
PDF
Citation
Code
»
Citation
×