← Back to Quantization
mixed int8/int6
QuantizationUsed in
4 PRs
Best BPB
1.0788
Avg BPB
1.2652
Submissions
Hyperparameters Across PRs
| pr_number | bits | scope |
|---|---|---|
| 172 | 8 | attention int8, MLP int6 |
| 435 | — | selected weights |
| 1718 | 8 | control tensors, small matrices, tok_emb |
| 1862 | — | model artifact |