← Back to Architecture

TRN hybrid

Architecture
Used in
1 PRs
Best BPB
1.4942
Avg BPB
1.4942

Hyperparameters Across PRs

pr_numberparameters
669{"layers":10,"trn_layers":7,"attention_layers":3}