← Back to Architecture

loop embeddings

Architecture
Used in
2 PRs
Best BPB
1.0788
Avg BPB
1.1752

Hyperparameters Across PRs

pr_numberparameters
319{"num_loops":3}
1518{"num_embeddings":3,"dimension":512,"init":"zero"}