-
Notifications
You must be signed in to change notification settings - Fork 146
Open
Description
First-Order Error Matters: Accurate Compensation for Quantized Large Language Models.:https://arxiv.org/abs/2507.11017
https://github.com/Xingyu-Zheng/FOEM
FOEM+GPTAQ+SpinQuant will be better.
FOEM can be seamlessly integrated with advanced techniques such as GPTAQ and SpinQuant, yielding additional improvements under
the challenging W4A4KV4 setting, and further narrowing the accuracy gap with
full-precision baselines beyond what current state-of-the-art methods achieve.
Metadata
Metadata
Assignees
Labels
No labels