← Back to Quantization

mixed int6/int7/int8

Quantization
Used in
4 PRs
Best BPB
1.0554
Avg BPB
1.0579

Hyperparameters Across PRs

pr_numberbitsscope
2097tok_emb.weight
2132model weights
2162weights, embeddings, attention gate
2164mixed