← Back to Quantization

mixed int6/int8 with STE

Quantization
Used in
1 PRs
Best BPB
1.2421
Avg BPB
1.2421

Hyperparameters Across PRs

pr_numberbitsscope
3706all weight matrices and embeddings