← Back to Architecture

MLP×5

Architecture
Used in
1 PRs
Best BPB
1.1454
Avg BPB
1.1454

Hyperparameters Across PRs

pr_numberparameters
420{"mlp_mult":5}