← Back to Quantization
Full GPTQ
QuantizationUsed in
2 PRs
Best BPB
1.1175
Avg BPB
1.1189
Hyperparameters Across PRs
| pr_number | bits | scope |
|---|---|---|
| 535 | 6 | all weights except small tensors and tok_emb.weight (fp16) |
| 569 | 6 | all large weights (MLP, attention, bigram, VE projections); int8 for embeddings |