← Back to Architecture

random basis MLP

Architecture
Used in
1 PRs
Best BPB
1.2554
Avg BPB
1.2554

Hyperparameters Across PRs

pr_numberparameters
1684{"mlp_mult":16,"layers":11,"dim":512}