← Back to Quantization

mixed int8/fp16 with custom codebook quantization

Quantization
Used in
1 PRs
Best BPB
1.0487
Avg BPB
1.0487

Hyperparameters Across PRs

pr_numberbitsscope
5328all weights except tied embeddings; per-tensor codebook levels for MLP/QKV/proj