← Back to Architecture

MLP2x

Architecture
Used in
1 PRs
Best BPB
1.2156
Avg BPB
1.2156

Hyperparameters Across PRs

pr_numberparameters
85{"mlp_multiplier":2}