← Back to Quantization
STE QAT int6
QuantizationUsed in
5 PRs
Best BPB
1.1502
Avg BPB
1.1935
Submissions
Hyperparameters Across PRs
| pr_number | bits | scope |
|---|---|---|
| 66 | 6 | CastedLinear weights / MLP and attention weights |
| 81 | 6 | all weights except tied embeddings |
| 86 | 6 | all block weights |
| 122 | 6 | row-wise weights; embeddings kept in fp16 |
| 488 | 6 | all weights |