← Back to Quantization
mixed int8/int6
QuantizationUsed in
3 PRs
Best BPB
1.0788
Avg BPB
1.2910
Hyperparameters Across PRs
| pr_number | bits | scope |
|---|---|---|
| 172 | 8 | attention int8, MLP int6 |
| 435 | — | selected weights |
| 1718 | 8 | control tensors, small matrices, tok_emb |