← Back to Quantization

mixed int5/int6 with fp16 embeddings

Quantization
Used in
1 PRs
Best BPB
1.2026
Avg BPB
1.2026

Hyperparameters Across PRs

pr_numberbitsscope
426MLP, attention, embeddings