← Back to Architecture
Sparse Attention Gate
ArchitectureUsed in
3 PRs
Best BPB
1.0586
Avg BPB
1.0647
Submissions
Hyperparameters Across PRs
| pr_number | parameters |
|---|---|
| 1855 | {"gate_window":12,"scale":0.5} |
| 1953 | — |
| 2088 | — |