← Back to Evaluation

test-time compute scaling

Evaluation
Used in
1 PRs
Best BPB
1.5283
Avg BPB
1.5283

Hyperparameters Across PRs

pr_numberparameters
54{"train_passes":3,"inference_passes":[6,8]}