← Back to Quantization

mixed int6/int5 QAT

Quantization
Used in
1 PRs
Best BPB
1.1227
Avg BPB
1.1227

Hyperparameters Across PRs

pr_numberbitsscope
4176int5 MLP layers, int6 attention