← Back to Architecture

depth embeddings

Architecture
Used in
1 PRs
Best BPB
1.2066
Avg BPB
1.2066

Hyperparameters Across PRs

pr_numberparameters
1472{"logical_layers":16}