← Back to LR Schedule

beta2 decay

LR Schedule
Used in
1 PRs
Best BPB
1.1349
Avg BPB
1.1349

Hyperparameters Across PRs

pr_numberparameters
646{"beta2":0.95,"learning_rate":0.001}