← Back to Architecture
Transformer depth/width
ArchitectureUsed in
1 PRs
Best BPB
1.6660
Avg BPB
1.6660
Submissions
Hyperparameters Across PRs
| pr_number | parameters |
|---|---|
| 240 | {"layers":7,"model_dim":512} |