← Back to Quantization

mixed int4/int6/int8

Quantization
Used in
1 PRs
Best BPB
1.0785
Avg BPB
1.0785

Hyperparameters Across PRs

pr_numberbitsscope
1731embeddings, attention, MLP, residuals