About
People
Publications
Open Source
Demos
Events
Blog
Careers
Contact
English
English
Français
ServiceNow
ServiceNow AI Research
Tags
Large Language Models
ServiceNow AI Research
Large Language Models
The BigCode Project Governance Card
This document serves as an overview of the different mechanisms and areas of governance in the BigCode project. It aims to support …
Sean Hughes
,
Harm de Vries
,
Jennifer Robinson
,
Carlos Muñoz Ferrandis
,
Loubna Ben Allal
,
Leandro von Werra
,
Jennifer Ding
,
Sébastien Paquet
,
Yacine Jernite
ArXiv, 2024.
PDF
Cite
Capture the Flag: Uncovering Data Insights with Large Language Models
The extraction of a small number of relevant insights from vast amounts of data is a crucial component of data-driven decision-making. …
Issam H. Laradji
,
Perouz Taslakian
,
Sai Rajeswar Mudumba
,
Valentina Zantedeschi
,
Alexandre Lacoste
,
Nicolas Chapados
,
David Vazquez
,
Christopher Pal
,
Alexandre Drouin
Workshop at the Neural Information Processing Systems (NeurIPS), 2023.
PDF
Cite
Code
Lag-Llama: A Foundation Model for Probabilistic Time Series Forecasting
In this work, we present Lag-Llama, a general-purpose probabilistic time series forecasting model trained on a large collection of time …
Kashif Rasul
,
Arjun Ashok
,
Marin Bilos
,
Andrew Williams
,
Arian Khorasani
,
George Adamopoulos
,
Rishika Bhagwatkar
,
Hena Ghonia
,
Nadhir Hassen
,
Anderson Schneider
,
Sahil Garg
,
Alexandre Drouin
,
Nicolas Chapados
,
Yuriy Nevmyvaka
,
Irina Rish
Workshop at the Neural Information Processing Systems (NeurIPS), 2023.
PDF
Cite
Code
The Unsolved Challenges of LLMs in Open-Ended Web Tasks: A Case Study
In this work, we investigate the challenges associated with developing goal-driven AI agents capable of performing open-ended tasks in …
Rim Assouel
,
Tom Marty
,
Massimo Caccia
,
Issam H. Laradji
,
Alexandre Drouin
,
Sai Rajeswar Mudumba
,
Hector Palacios
,
Quentin Cappart
,
David Vazquez
,
Nicolas Chapados
,
Maxime Gasse
,
Alexandre Lacoste
Workshop at the Neural Information Processing Systems (NeurIPS), 2023.
PDF
Cite
Video
LLM aided semi-supervision for efficient Extractive Dialog Summarization
Generating high-quality summaries for chat dialogs often requires large labeled datasets. We propose a method to efficiently use …
Nishant Mishra
,
Gaurav Sahu
,
Iacer Calixto
,
Ameen Abu-Hanna
,
Issam H. Laradji
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
PDF
Cite
MAGNIFICO: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations
Humans possess a remarkable ability to assign novel interpretations to linguistic expressions, enabling them to learn new words and …
Arkil Patel
,
Satwik Bhattamishra
,
Siva Reddy
,
Dzmitry Bahdanau
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
PDF
Cite
Code
PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation
Data augmentation is a widely used technique to address the problem of text classification when there is a limited amount of training …
Gaurav Sahu
,
Olga Vechtomova
,
Dzmitry Bahdanau
,
Issam H. Laradji
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
PDF
Cite
Code
Video
Equivariant Adaptation of Large Pre-Trained Models
Equivariant networks are specifically designed to ensure consistent behavior with respect to a set of input transformations, leading to …
Arnab Mondal
,
Siba Smarak Panigrahi
,
Sai Rajeswar Mudumba
,
Siamak Ravanbakhsh
Conference on Neural Information Processing Systems (NeurIPS), 2023.
PDF
Cite
Causal Discovery with Language Models as Imperfect Experts
Understanding the causal relationships that underlie a system is a fundamental prerequisite to accurate decision-making. In this work, …
Stephanie Long
,
Alexandre Piche
,
Valentina Zantedeschi
,
Tibor Schuster
,
Alexandre Drouin
Workshop on Structured Probabilistic Inference & Generative Modeling (ICML), 2023.
PDF
Cite
Code
Slides
Video
StarCoder: may the source be with you!
The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code …
Raymond Li
,
Loubna Ben Allal
,
Yangtian Zi
,
Denis Kocetkov
,
Chenghao Mou
,
Christopher Akiki
,
Jia Li
,
Jenny Chim
,
Terry Yue Zhuo
,
Thomas Wang
,
Mishig Davaadorj
,
João Monteiro
,
Oleh Shliazhko
,
Nicolas Gontier
,
Nicholas Meade
,
Ming-Ho Yee
,
Logesh Kumar Umapathi
,
Benjamin Lipkin
,
Zhiruo Wang
,
Rudra Murthy
,
Jason Stillerman
,
Siva Sankalp Patel
,
Dmitry Abulkhanov
,
Marco Zocca
,
Zhihan Zhang
,
Nour Fahmy
,
Urvashi Bhattacharyya
,
Swayam Singh
,
Sasha Luccioni
,
Paulo Villegas
,
Maxim Kunakov
,
Fedor Zhdanov
,
Manuel Romero
,
Tony Lee
,
Nadav Timor
,
Jennifer Ding
,
Claire Schlesinger
,
Hailey Schoelkopf
,
Jan Ebert
,
Jennifer Robinson
,
Carolyn Jane Anderson
,
Brendan Dolan-Gavitt
,
Danish Contractor
,
Siva Reddy
,
Daniel Fried
,
Dzmitry Bahdanau
,
Yacine Jernite
,
Carlos Muñoz Ferrandis
,
Sean Hughes
,
Thomas Wolf
,
Arjun Guha
,
Leandro von Werra
,
Harm de Vries
,
Joel Lamy Poirier
,
Alex Gu
,
Armel Zebaze
,
Jian Zhu
,
Manan Dey
,
Marc Marone
,
Mayank Mishra
,
Muhtasham Oblokulov
,
Olivier Dehaene
,
Qian Liu
,
Tri Dao
,
Wenhao Yu
,
Niklas Muennighoff
Transactions on Machine Learning Research (TMLR), 2023.
PDF
Cite
Code
«
»
Cite
×