← Back to Architecture

MLP3x/MLP4x

Architecture
Used in
1 PRs
Best BPB
1.2417
Avg BPB
1.2417

Hyperparameters Across PRs

pr_numberparameters
393{"mlp_multiplier":4}