About
People
Publications
Open Source
Demos
Events
Blog
Careers
Contact
English
English
Français
ServiceNow
ServiceNow AI Research
People
Max Tian
ServiceNow AI Research
Max Tian
Publications
Using Scaling Laws for Data Source Utility Estimation in Domain-Specific Pre-Training
.
Oleksiy Ostapenko
,
Charles Guille-Escuret
,
Luke Kumar
,
Max Tian
,
Denis Kocetkov
,
Gopeshh Subbaraj
,
Raymond Li
,
Joel Lamy Poirier
,
Sébastien Paquet
,
Torsten Scholak
. At
Conference on Language Modeling Workshops, 2025.
PDF
Cite
StarCoder 2 and The Stack v2: The Next Generation
.
Anton Lozhkov
,
Raymond Li
,
Loubna Ben Allal
,
Federico Cassano
,
Joel Lamy Poirier
,
Nouamane Tazi
,
Ao Tang
,
Dmytro Pykhtar
,
Jiawei Liu
,
Yuxiang Wei
,
Tianyang Liu
,
Max Tian
,
Denis Kocetkov
,
Arthur Zucker
,
Younes Belkada
,
Zijian Wang
,
Dmitry Abulkhanov
,
Indraneil Paul
,
Zhuang Li
,
Wen-Ding Li
,
Megan Risdal
,
Jia Li
,
Terry Yue Zhuo
,
Nii Osae Osae Dade
,
Lucas Krauß
,
Naman Jain
,
Yixuan Su
,
Xuanli He
,
Edoardo Abati
,
Yekun Chai
,
Xiangru Tang
,
Christopher Akiki
,
Chenghao Mou
,
Binyuan Hui
,
Nicolas Patry
,
Canwen Xu
,
Julian McAuley
,
Han Hu
,
Torsten Scholak
,
Sébastien Paquet
,
Jennifer Robinson
,
Carolyn Jane Anderson
,
Nicolas Chapados
,
Mostofa Patwary
,
Nima Tajbakhsh
,
Yacine Jernite
,
Carlos Muñoz Ferrandis
,
Lingming Zhang
,
Sean Hughes
,
Thomas Wolf
,
Arjun Guha
,
Leandro von Werra
,
Harm de Vries
,
Alex Gu
,
Armel Zebaze
,
Evgenii Zheltonozhskii
,
Jian Zhu
,
Manan Dey
,
Marc Marone
,
Mayank Mishra
,
Muhtasham Oblokulov
,
Olivier Dehaene
,
Qian Liu
,
Tri Dao
,
Wenhao Yu
,
Niklas Muennighoff
. At
ArXiv, 2024.
PDF
Cite
Video
Cite
×