← Back to Quantization
mixed int6/int5
QuantizationUsed in
5 PRs
Best BPB
1.0924
Avg BPB
1.2534
Submissions
Hyperparameters Across PRs
| pr_number | bits | scope |
|---|---|---|
| 504 | — | mlp, attn, bigram, trigram |
| 1123 | — | Q/K, V/O, MLP, embeddings |
| 1170 | — | MLP layers |
| 1279 | — | all |
| 1543 | — | attention weights and MLP weights |