← Back to Architecture

MLP3x/4x MLP

Architecture
Used in
1 PRs
Best BPB
1.2392
Avg BPB
1.2392

Hyperparameters Across PRs

pr_numberparameters
436{"mlp_mult":4}