← Back to Architecture

ReLU^2 MLP

Architecture
Used in
1 PRs
Best BPB
1.8480
Avg BPB
1.8480

Hyperparameters Across PRs

pr_numberparameters
220