← Back to Quantization

mixed int6/int5

Quantization
Used in
5 PRs
Best BPB
1.0924
Avg BPB
1.2534

Hyperparameters Across PRs

pr_numberbitsscope
504mlp, attn, bigram, trigram
1123Q/K, V/O, MLP, embeddings
1170MLP layers
1279all
1543attention weights and MLP weights