Pushing the Limits of Large Language Model Quantization via the Linearity Theorem
Paper
•
2411.17525
•
Published
Models prequantized with [HIGGS](https://arxiv.org/abs/2411.17525) zero-shot quantization. Requires the latest `transformers` to run.