← Back to Architecture

depth-scaled residual

Architecture
Used in
2 PRs
Best BPB
0.7853
Avg BPB
0.9689

Hyperparameters Across PRs

pr_numberparameters
568
633