← Back to Architecture

MLP activation

Architecture
Used in
6 PRs
Best BPB
1.2302
Avg BPB
1.4614

Hyperparameters Across PRs

pr_numberparameters
73
554
675{"negative_slope":0.5,"power":2}
830{"negative_slope":0.5}
1299
1434{"activation":"silu2"}