About
People
Publications
Open Source
Demos
Events
Blog
Careers
Contact
English
English
Français
ServiceNow
ServiceNow Research
People
Jia Li
ServiceNow Research
Jia Li
Publications
StarCoder 2 and The Stack v2: The Next Generation
.
Anton Lozhkov
,
Raymond Li
,
Loubna Ben Allal
,
Federico Cassano
,
Joel Lamy Poirier
,
Nouamane Tazi
,
Ao Tang
,
Dmytro Pykhtar
,
Jiawei Liu
,
Yuxiang Wei
,
Tianyang Liu
,
Max Tian
,
Denis Kocetkov
,
Arthur Zucker
,
Younes Belkada
,
Zijian Wang
,
Dmitry Abulkhanov
,
Indraneil Paul
,
Zhuang Li
,
Wen-Ding Li
,
Megan Risdal
,
Jia Li
,
Terry Yue Zhuo
,
Nii Osae Osae Dade
,
Lucas Krauß
,
Naman Jain
,
Yixuan Su
,
Xuanli He
,
Edoardo Abati
,
Yekun Chai
,
Xiangru Tang
,
Christopher Akiki
,
Chenghao Mou
,
Binyuan Hui
,
Nicolas Patry
,
Canwen Xu
,
Julian McAuley
,
Han Hu
,
Torsten Scholak
,
Sébastien Paquet
,
Jennifer Robinson
,
Carolyn Jane Anderson
,
Nicolas Chapados
,
Mostofa Patwary
,
Nima Tajbakhsh
,
Yacine Jernite
,
Carlos Muñoz Ferrandis
,
Lingming Zhang
,
Sean Hughes
,
Thomas Wolf
,
Arjun Guha
,
Leandro von Werra
,
Harm de Vries
,
Alex Gu
,
Armel Zebaze
,
Evgenii Zheltonozhskii
,
Jian Zhu
,
Manan Dey
,
Marc Marone
,
Mayank Mishra
,
Muhtasham Oblokulov
,
Olivier Dehaene
,
Qian Liu
,
Tri Dao
,
Wenhao Yu
,
Niklas Muennighoff
. At
ArXiv, 2024.
PDF
Cite
Code
Video
StarCoder: may the source be with you!
.
Raymond Li
,
Loubna Ben Allal
,
Yangtian Zi
,
Denis Kocetkov
,
Chenghao Mou
,
Christopher Akiki
,
Jia Li
,
Jenny Chim
,
Terry Yue Zhuo
,
Thomas Wang
,
Mishig Davaadorj
,
João Monteiro
,
Oleh Shliazhko
,
Nicolas Gontier
,
Nicholas Meade
,
Ming-Ho Yee
,
Logesh Kumar Umapathi
,
Benjamin Lipkin
,
Zhiruo Wang
,
Rudra Murthy
,
Jason Stillerman
,
Siva Sankalp Patel
,
Dmitry Abulkhanov
,
Marco Zocca
,
Zhihan Zhang
,
Nour Fahmy
,
Urvashi Bhattacharyya
,
Swayam Singh
,
Sasha Luccioni
,
Paulo Villegas
,
Maxim Kunakov
,
Fedor Zhdanov
,
Manuel Romero
,
Tony Lee
,
Nadav Timor
,
Jennifer Ding
,
Claire Schlesinger
,
Hailey Schoelkopf
,
Jan Ebert
,
Jennifer Robinson
,
Carolyn Jane Anderson
,
Brendan Dolan-Gavitt
,
Danish Contractor
,
Siva Reddy
,
Daniel Fried
,
Dzmitry Bahdanau
,
Yacine Jernite
,
Carlos Muñoz Ferrandis
,
Sean Hughes
,
Thomas Wolf
,
Arjun Guha
,
Leandro von Werra
,
Harm de Vries
,
Joel Lamy Poirier
,
Alex Gu
,
Armel Zebaze
,
Jian Zhu
,
Manan Dey
,
Marc Marone
,
Mayank Mishra
,
Muhtasham Oblokulov
,
Olivier Dehaene
,
Qian Liu
,
Tri Dao
,
Wenhao Yu
,
Niklas Muennighoff
. At
Transactions on Machine Learning Research (TMLR), 2023.
PDF
Cite
Code
The Stack: 3 TB of permissively licensed source code
.
Denis Kocetkov
,
Raymond Li
,
Loubna Ben Allal
,
Jia Li
,
Chenghao Mou
,
Carlos Muñoz Ferrandis
,
Yacine Jernite
,
Margaret Mitchell
,
Sean Hughes
,
Thomas Wolf
,
Dzmitry Bahdanau
,
Leandro von Werra
,
Harm de Vries
. At
Transactions on Machine Learning Research (TMLR), 2022.
PDF
Cite
Code
Cite
×