← Back to Quantization

mixed int8/int6

Quantization
Used in
3 PRs
Best BPB
1.0788
Avg BPB
1.2910

Hyperparameters Across PRs

pr_numberbitsscope
1728attention int8, MLP int6
435selected weights
17188control tensors, small matrices, tok_emb