← Back to Quantization
mixed int6/int8 with GPTQ-lite
QuantizationUsed in
1 PRs
Best BPB
1.1804
Avg BPB
1.1804
Submissions
Hyperparameters Across PRs
| pr_number | bits | scope |
|---|---|---|
| 543 | — | layers 1-9 int6, layers 0 and 10 int8, FP16 embeddings |