← Back to Architecture

MLP_MULT reduction

Architecture
Used in
1 PRs
Best BPB
1.1407
Avg BPB
1.1407

Hyperparameters Across PRs

pr_numberparameters
845{"mlp_mult":2.6}