← Back to Quantization

mixed int6/int5

Quantization
Used in
7 PRs
Best BPB
1.0924
Avg BPB
1.2110

Hyperparameters Across PRs

pr_numberbitsscope
504mlp, attn, bigram, trigram
1123Q/K, V/O, MLP, embeddings
1170MLP layers
1279all
1543attention weights and MLP weights
1818weights and embeddings
1985block weights