← Back to Optimizer

MuonEq-R

Optimizer
Used in
3 PRs
Best BPB
1.0679
Avg BPB
1.0824

Hyperparameters Across PRs

pr_numberweight_decaymomentumother_params
1326
1334{"row_normalized":true}
1485{"row_normalized_newton_schulz":true}