← Back to Architecture

encoder-decoder split

Architecture
Used in
1 PRs
Best BPB
1.1567
Avg BPB
1.1567

Hyperparameters Across PRs

pr_numberparameters
1380{"num_encoder_layers":1}