← Back to Quantization

STE QAT int6

Quantization
Used in
5 PRs
Best BPB
1.1502
Avg BPB
1.1935

Hyperparameters Across PRs

pr_numberbitsscope
666CastedLinear weights / MLP and attention weights
816all weights except tied embeddings
866all block weights
1226row-wise weights; embeddings kept in fp16
4886all weights