Accueil
Équipe
Publications
Open source
Démos
Évènements
Blog
Carrières
Nous joindre
Français
Français
English
ServiceNow
ServiceNow IA recherche
Publication_types
2
ServiceNow IA recherche
2
Bridging the Gap Between Target Networks and Functional Regularization
Target networks are at the core of recent success in Reinforcement Learning. They stabilize the training by using old parameters to …
Alexandre Piche
,
Valentin Thomas
,
Joseph Marino
,
Gian Maria Marconi
,
Mohammad Emtiyaz Khan
,
Christopher Pal
Transactions on Machine Learning Research (TMLR), 2023.
PDF
Citation
Using Confounded Data in Latent Model-Based Reinforcement Learning
In the presence of confounding, naively using off-the-shelf offline reinforcement learning (RL) algorithms leads to sub-optimal …
Maxime Gasse
,
Damien Grasset
,
Pierre-Yves Oudeyer
,
Guillaume Gaudron
Transactions on Machine Learning Research (TMLR), 2023.
PDF
Citation
StarCoder: may the source be with you!
The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code …
Raymond Li
,
Loubna Ben Allal
,
Yangtian Zi
,
Denis Kocetkov
,
Chenghao Mou
,
Christopher Akiki
,
Jia Li
,
Jenny Chim
,
Terry Yue Zhuo
,
Thomas Wang
,
Mishig Davaadorj
,
João Monteiro
,
Oleh Shliazhko
,
Nicolas Gontier
,
Nicholas Meade
,
Ming-Ho Yee
,
Logesh Kumar Umapathi
,
Benjamin Lipkin
,
Zhiruo Wang
,
Rudra Murthy
,
Jason Stillerman
,
Siva Sankalp Patel
,
Dmitry Abulkhanov
,
Marco Zocca
,
Zhihan Zhang
,
Nour Fahmy
,
Urvashi Bhattacharyya
,
Swayam Singh
,
Sasha Luccioni
,
Paulo Villegas
,
Maxim Kunakov
,
Fedor Zhdanov
,
Manuel Romero
,
Tony Lee
,
Nadav Timor
,
Jennifer Ding
,
Claire Schlesinger
,
Hailey Schoelkopf
,
Jan Ebert
,
Jennifer Robinson
,
Carolyn Jane Anderson
,
Brendan Dolan-Gavitt
,
Danish Contractor
,
Siva Reddy
,
Daniel Fried
,
Dzmitry Bahdanau
,
Yacine Jernite
,
Carlos Muñoz Ferrandis
,
Sean Hughes
,
Thomas Wolf
,
Arjun Guha
,
Leandro von Werra
,
Harm de Vries
,
Joel Lamy Poirier
,
Alex Gu
,
Armel Zebaze
,
Jian Zhu
,
Manan Dey
,
Marc Marone
,
Mayank Mishra
,
Muhtasham Oblokulov
,
Olivier Dehaene
,
Qian Liu
,
Tri Dao
,
Wenhao Yu
,
Niklas Muennighoff
Transactions on Machine Learning Research (TMLR), 2023.
PDF
Citation
Knowledge Hypergraph Embedding Meets Relational Algebra
Embedding-based methods for reasoning in knowledge hypergraphs learn a representation for each entity and relation. Current methods do …
Bahare Fatemi
,
Perouz Taslakian
,
David Vazquez
,
David Poole
Journal of Machine Learning Research (JMLR), 2023.
PDF
Citation
Towards Learning to Imitate from a Single Video Demonstration
Agents that can learn to imitate given video observation – without direct access to state or action information are more …
Glen Berseth
,
Florian Golemo
,
Christopher Pal
Journal of Machine Learning Research (JMLR), 2023.
PDF
Citation
Workflow discovery in low data regimes
Text-based dialogues are now widely used to solve real-world problems. In cases where solution strategies are already known, they can …
Amine El Hattami
,
Issam H. Laradji
,
Stefania Raimondo
,
David Vazquez
,
Pau Rodriguez
,
Christopher Pal
Transactions on Machine Learning Research, 2023.
PDF
Citation
Advancing ethics review practices in AI research
The implementation of ethics review processes is an important first step for anticipating and mitigating the potential harms of AI …
Madhulika Srikumar
,
Grace Abuhamad
,
Joelle Pineau
,
Rebecca Finlay
,
Carolyn Ashurst
,
Rosie Campbell
,
Emily Campbell-Ratcliffe
,
Hudson Hongo
,
Sarah R. Jordan
,
Joseph Lindley
,
Aviv Ovadya
Nature Machine Intelligence, 2022.
PDF
Citation
Does entity abstraction help generative Transformers reason?
We study the utility of incorporating entity type abstractions into pre-trained Transformers and test these methods on four NLP tasks …
Nicolas Gontier
,
Siva Reddy
,
Christopher Pal
Transactions on Machine Learning Research, 2022.
PDF
Citation
The Stack: 3 TB of permissively licensed source code
Large Language Models (LLMs) play an ever-increasing role in the field of Artificial Intelligence (AI)–not only for natural …
Denis Kocetkov
,
Raymond Li
,
Loubna Ben Allal
,
Jia Li
,
Chenghao Mou
,
Carlos Muñoz Ferrandis
,
Yacine Jernite
,
Margaret Mitchell
,
Sean Hughes
,
Thomas Wolf
,
Dzmitry Bahdanau
,
Leandro von Werra
,
Harm de Vries
Transactions on Machine Learning Research (TMLR), 2022.
PDF
Citation
A Closer Look at Embedding Propagation for Manifold Smoothing
Supervised training of neural networks requires a large amount of manually annotated data and the resulting networks tend to be …
Diego Velazquez
,
Pau Rodriguez
,
Josep M. Gonfaus
,
F. Xavier Roca
,
Jordi Gonzalez
Journal of Machine Learning Research (JMLR), 2022.
PDF
Citation
«
»
Citation
×