About
People
Publications
Open Source
Demos
Events
Blog
Careers
Contact
English
English
Français
ServiceNow
ServiceNow Research
People
Mihai Surdeanu
ServiceNow Research
Mihai Surdeanu
Publications
Change Is the Only Constant: Dynamic LLM Slicing based on Layer Redundancy
.
Razvan-Gabriel Dumitru
,
Paul-Ioan Clotan
,
Vikas Yadav
,
Darius Peteleaza
,
Mihai Surdeanu
. At
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
PDF
Cite
Layer-Wise Quantization: A Pragmatic and Effective Method for Quantizing LLMs Beyond Integer Bit-Levels
.
Razvan-Gabriel Dumitru
,
Vikas Yadav
,
Rishabh Maheshwary
,
Paul-Ioan Clotan
,
Sathwik Tejaswi Madhusudhan
,
Mihai Surdeanu
. At
ArXiv, 2024.
PDF
Cite
Code
Cite
×