← Back to Quantization
int5/int6
QuantizationUsed in
3 PRs
Best BPB
1.1417
Avg BPB
1.1659
Hyperparameters Across PRs
| pr_number | bits | scope |
|---|---|---|
| 511 | — | — |
| 515 | — | MLP weights (int5), attention weights (int6, per-row scale) |
| 547 | — | MLP matrices (int5), attention matrices (int6), embeddings (int6) |