← Back to Quantization
mixed int5/int6 with QAT
QuantizationUsed in
1 PRs
Best BPB
1.4222
Avg BPB
1.4222
Submissions
Hyperparameters Across PRs
| pr_number | bits | scope |
|---|---|---|
| 619 | — | int5 for MLP weights, int6 for attention/bigram-sensitive weights |