← Back to Regularization

adaptive clip

Regularization
Used in
1 PRs
Best BPB
1.0719
Avg BPB
1.0719

Hyperparameters Across PRs

pr_numberparameters
1626{"mlp_sigmas":12,"attn_sigmas":13,"embed_sigmas":15}