← Back to Quantization
mixed int6/fp16
QuantizationUsed in
1 PRs
Best BPB
1.1632
Avg BPB
1.1632
Submissions
Hyperparameters Across PRs
| pr_number | bits | scope |
|---|---|---|
| 66 | 6 | MLP and attention weights int6, tied embedding fp16 passthrough |