← Back to Quantization

mixed int5/int6 GPTQ

Quantization
Used in
2 PRs
Best BPB
1.0903
Avg BPB
1.0916

Hyperparameters Across PRs

pr_numberbitsscope
1260all layers
1817flat attention int5, rest int6, embeddings int8