← Back to Architecture

MLP width multiplier

Architecture
Used in
1 PRs
Best BPB
1.5248
Avg BPB
1.5248

Hyperparameters Across PRs

pr_numberparameters
502{"MLP_MULT":2}