About
People
Publications
Open Source
Demos
Events
Blog
Careers
Contact
English
English
Français
ServiceNow
ServiceNow AI Research
People
Mihai Surdeanu
ServiceNow AI Research
Mihai Surdeanu
Publications
Change Is the Only Constant: Dynamic LLM Slicing based on Layer Redundancy
.
Razvan-Gabriel Dumitru
,
Paul-Ioan Clotan
,
Vikas Yadav
,
Darius Peteleaza
,
Mihai Surdeanu
. At
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
PDF
Cite
Layer-Wise Quantization: A Pragmatic and Effective Method for Quantizing LLMs Beyond Integer Bit-Levels
.
Razvan-Gabriel Dumitru
,
Vikas Yadav
,
Rishabh Maheshwary
,
Paul-Ioan Clotan
,
Sathwik Tejaswi Madhusudhan
,
Mihai Surdeanu
. At
ArXiv, 2024.
PDF
Cite
Cite
×