← Back to Quantization
FP8
QuantizationUsed in
3 PRs
Best BPB
1.2064
Avg BPB
1.4151
Submissions
Hyperparameters Across PRs
| pr_number | bits | scope |
|---|---|---|
| 739 | 8 | all persistent state (master weights, optimizer momentum) |
| 903 | 8 | embeddings and medium matrices |
| 1388 | 8 | FP parameters and scales |