← Back to Architecture

layer looping

Architecture
Used in
1 PRs
Best BPB
1.2987
Avg BPB
1.2987

Hyperparameters Across PRs

pr_numberparameters
146{"unique_layers":6,"model_dim":608,"looped_layers":9}