← Back to Architecture

MLP2x

Architecture
Used in
2 PRs
Best BPB
1.2156
Avg BPB
1.3143

Hyperparameters Across PRs

pr_numberparameters
85{"mlp_multiplier":2}
1782{"multiplier":2}