HIGGS

ISTA-DASLab 's Collections

AQLM

updated 14 days ago

Models prequantized with [HIGGS](https://arxiv.org/abs/2411.17525) zero-shot quantization. Requires the latest `transformers` to run.

Upvote

Pushing the Limits of Large Language Model Quantization via the Linearity Theorem

Paper • 2411.17525 • Published Nov 26, 2024
ISTA-DASLab/Llama-3.3-70B-Instruct-HIGGS-GPTQ-4bit

Updated Dec 12, 2024 • 46 • 2
ISTA-DASLab/Llama-3.1-8B-Instruct-HIGGS-GPTQ-4bit

Text Generation • Updated Dec 10, 2024 • 9
ISTA-DASLab/Llama-3.1-8B-Instruct-HIGGS-GPTQ-3bit

Text Generation • Updated Dec 10, 2024 • 19
ISTA-DASLab/Llama-3.1-8B-HIGGS-GPTQ-4bit

Text Generation • Updated Dec 10, 2024 • 12
ISTA-DASLab/Llama-3.1-8B-HIGGS-GPTQ-3bit

Text Generation • Updated Dec 10, 2024 • 22
ISTA-DASLab/Llama-3.3-70B-Instruct-HIGGS-4bit

Text Generation • Updated Dec 9, 2024 • 29 • 3
ISTA-DASLab/Llama-3.3-70B-Instruct-HIGGS-3bit

Text Generation • Updated Dec 6, 2024 • 9
ISTA-DASLab/Llama-3.1-70B-Instruct-HIGGS-4bit

Text Generation • Updated Dec 6, 2024 • 8
ISTA-DASLab/Llama-3.1-70B-Instruct-HIGGS-3bit

Text Generation • Updated Dec 6, 2024 • 7
ISTA-DASLab/Llama-3.1-8B-Instruct-HIGGS-4bit

Text Generation • Updated Dec 6, 2024 • 30
ISTA-DASLab/Llama-3.1-8B-Instruct-HIGGS-3bit

Text Generation • Updated Dec 6, 2024 • 11
ISTA-DASLab/Llama-3.1-8B-HIGGS-4bit

Text Generation • Updated Dec 6, 2024 • 38
ISTA-DASLab/Llama-3.1-8B-HIGGS-3bit

Text Generation • Updated Dec 6, 2024 • 29
ISTA-DASLab/gemma-2-9b-it-HIGGS-4bit

Text Generation • Updated Dec 6, 2024 • 11
ISTA-DASLab/gemma-2-9b-it-HIGGS-3bit

Text Generation • Updated Dec 6, 2024 • 12
ISTA-DASLab/gemma-2-9b-HIGGS-4bit

Text Generation • Updated Dec 6, 2024 • 14
ISTA-DASLab/gemma-2-9b-HIGGS-3bit

Text Generation • Updated Dec 6, 2024 • 12

Upvote