← Back to Quantization

BitNet b1.58 ternary quantisation with FP8 QAT

Quantization
Used in
1 PRs
Best BPB
1.1570
Avg BPB
1.1570

Hyperparameters Across PRs

pr_numberbitsscope
6401weights ternary {-1,0,+1} with FP8 QAT for fp params