Accueil
Équipe
Publications
Évènements
Blog
Carrières
Nous joindre
Français
Français
English
ServiceNow
ServiceNow IA recherche
Tags
Large Language Models
ServiceNow IA recherche
Large Language Models
Layer-Wise Quantization: A Pragmatic and Effective Method for Quantizing LLMs Beyond Integer Bit-Levels
We present a simple meta quantization approach that quantizes different layers of a large language model (LLM) at different bit levels, …
Razvan-Gabriel Dumitru
,
Vikas Yadav
,
Rishabh Maheshwary
,
Paul-Ioan Clotan
,
Sathwik Tejaswi Madhusudhan
,
Mihai Surdeanu
ArXiv, 2024.
Article
Citation
Code
LitLLM: A Toolkit for Scientific Literature Review
Literature reviews are an essential component of scientific research. We explore the zero-shot abilities of recent large language …
Shubham Agarwal
,
Abhay Puri
,
Issam H. Laradji
,
Laurent Charlin
,
Christopher Pal
ArXiv, 2024.
Article
Citation
Code
M-RewardBench: Evaluating Reward Models in Multilingual Settings
Reward models (RMs) have driven the state-of-the-art performance of LLMs today by enabling the integration of human feedback into the …
Srishti Gureja
,
Lester James V. Miranda
,
Shayekh Bin Islam
,
Rishabh Maheshwary
,
Drishti Sharma
,
Gusti Winata
,
Nathan Lambert
,
Sebastian Ruder
,
Sara Hooker
,
Marzieh Fadaee
ArXiv, 2024.
Article
Citation
MixSumm: Topic-based Data Augmentation using LLMs for Low-resource Extractive Text Summarization
Low-resource extractive text summarization is a vital but heavily underexplored area of research. Prior literature either focuses on …
Issam H. Laradji
,
Gaurav Sahu
ArXiv, 2024.
Article
Citation
StarCoder 2 and The Stack v2: The Next Generation
The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code …
Anton Lozhkov
,
Raymond Li
,
Loubna Ben Allal
,
Federico Cassano
,
Joel Lamy Poirier
,
Nouamane Tazi
,
Ao Tang
,
Dmytro Pykhtar
,
Jiawei Liu
,
Yuxiang Wei
,
Tianyang Liu
,
Max Tian
,
Denis Kocetkov
,
Arthur Zucker
,
Younes Belkada
,
Zijian Wang
,
Dmitry Abulkhanov
,
Indraneil Paul
,
Zhuang Li
,
Wen-Ding Li
,
Megan Risdal
,
Jia Li
,
Terry Yue Zhuo
,
Nii Osae Osae Dade
,
Lucas Krauß
,
Naman Jain
,
Yixuan Su
,
Xuanli He
,
Edoardo Abati
,
Yekun Chai
,
Xiangru Tang
,
Christopher Akiki
,
Chenghao Mou
,
Binyuan Hui
,
Nicolas Patry
,
Canwen Xu
,
Julian McAuley
,
Han Hu
,
Torsten Scholak
,
Sébastien Paquet
,
Jennifer Robinson
,
Carolyn Jane Anderson
,
Nicolas Chapados
,
Mostofa Patwary
,
Nima Tajbakhsh
,
Yacine Jernite
,
Carlos Muñoz Ferrandis
,
Lingming Zhang
,
Sean Hughes
,
Thomas Wolf
,
Arjun Guha
,
Leandro von Werra
,
Harm de Vries
,
Alex Gu
,
Armel Zebaze
,
Evgenii Zheltonozhskii
,
Jian Zhu
,
Manan Dey
,
Marc Marone
,
Mayank Mishra
,
Muhtasham Oblokulov
,
Olivier Dehaene
,
Qian Liu
,
Tri Dao
,
Wenhao Yu
,
Niklas Muennighoff
ArXiv, 2024.
Article
Citation
Code
Vidéo
StarVector: Generating Scalable Vector Graphics Code from Images and Text
Scalable Vector Graphics (SVGs) have become integral in modern image rendering and graphic design applications due to their infinite …
Juan A. Rodriguez
,
Shubham Agarwal
,
Abhay Puri
,
Issam H. Laradji
,
Sai Rajeswar Mudumba
,
Pau Rodriguez
,
David Vazquez
,
Christopher Pal
,
Marco Pedersoli
ArXiv, 2024.
Article
Citation
TapeAgents: a Holistic Framework for Agent Development and Optimization
We present TapeAgents, an agent framework that leverages a structured, replayable log (tape) of the agent session to facilitate all …
Dzmitry Bahdanau
,
Nicolas Gontier
,
Gabriel Huang
,
Ehsan Kamalloo
,
Rafael Pardinas
,
Alexandre Piche
,
Torsten Scholak
,
Oleh Shliazhko
,
Jordan Prince Tremblay
,
Karam Ghanem
,
Soham Parikh
,
Mitul Tiwari
,
Quaizar Vohra
ArXiv, 2024.
Article
Citation
Code
Vidéo
The BigCode Project Governance Card
This document serves as an overview of the different mechanisms and areas of governance in the BigCode project. It aims to support …
Sean Hughes
,
Harm de Vries
,
Jennifer Robinson
,
Carlos Muñoz Ferrandis
,
Loubna Ben Allal
,
Leandro von Werra
,
Jennifer Ding
,
Sébastien Paquet
,
Yacine Jernite
ArXiv, 2024.
Article
Citation
Capture the Flag: Uncovering Data Insights with Large Language Models
The extraction of a small number of relevant insights from vast amounts of data is a crucial component of data-driven decision-making. …
Issam H. Laradji
,
Perouz Taslakian
,
Sai Rajeswar Mudumba
,
Valentina Zantedeschi
,
Alexandre Lacoste
,
Nicolas Chapados
,
David Vazquez
,
Christopher Pal
,
Alexandre Drouin
Workshop at the Neural Information Processing Systems (NeurIPS), 2023.
Article
Citation
Code
Lag-Llama: A Foundation Model for Probabilistic Time Series Forecasting
In this work, we present Lag-Llama, a general-purpose probabilistic time series forecasting model trained on a large collection of time …
Kashif Rasul
,
Arjun Ashok
,
Marin Bilos
,
Andrew Williams
,
Arian Khorasani
,
George Adamopoulos
,
Rishika Bhagwatkar
,
Hena Ghonia
,
Nadhir Hassen
,
Anderson Schneider
,
Sahil Garg
,
Alexandre Drouin
,
Nicolas Chapados
,
Yuriy Nevmyvaka
,
Irina Rish
Workshop at the Neural Information Processing Systems (NeurIPS), 2023.
Article
Citation
Code
«
»
Citation
×