← Back to Optimizer
Muon (matrix), Adam (scalar/embed)
OptimizerUsed in
1 PRs
Best BPB
1.1828
Avg BPB
1.1828
Submissions
Hyperparameters Across PRs
| pr_number | weight_decay | momentum | other_params |
|---|---|---|---|
| 599 | — | — | {"matrix_lr":0.02,"scalar_lr":0.02} |