← Back to LR Schedule

matrix learning rate tuning

LR Schedule
Used in
1 PRs
Best BPB
0.1582
Avg BPB
0.1582

Hyperparameters Across PRs

pr_numberparameters
859{"matrix_lr":0.03}