← Back to Architecture
learned mixer head
ArchitectureUsed in
1 PRs
Best BPB
0.1582
Avg BPB
0.1582
Submissions
Hyperparameters Across PRs
| pr_number | parameters |
|---|---|
| 859 | {"input_dim":512,"output_dim":7} |