← Back to Architecture

MLP4

Architecture
Used in
2 PRs
Best BPB
1.3092
Avg BPB
1.3938

Hyperparameters Across PRs

pr_numberparameters
408{"mlp_mult":4}
759