← Back to Quantization
mixed int6/int5
QuantizationUsed in
7 PRs
Best BPB
1.0924
Avg BPB
1.2110
Submissions
Hyperparameters Across PRs
| pr_number | bits | scope |
|---|---|---|
| 504 | — | mlp, attn, bigram, trigram |
| 1123 | — | Q/K, V/O, MLP, embeddings |
| 1170 | — | MLP layers |
| 1279 | — | all |
| 1543 | — | attention weights and MLP weights |
| 1818 | — | weights and embeddings |
| 1985 | — | block weights |