← Back to Architecture

decoder depth

Architecture
Used in
1 PRs
Best BPB
1.5207
Avg BPB
1.5207

Hyperparameters Across PRs

pr_numberparameters
1434{"encoder_layers":0}