← Back to Quantization
mixed int5/int6 GPTQ
QuantizationUsed in
2 PRs
Best BPB
1.0903
Avg BPB
1.0916
Hyperparameters Across PRs
| pr_number | bits | scope |
|---|---|---|
| 1260 | — | all layers |
| 1817 | — | flat attention int5, rest int6, embeddings int8 |