← Back to Architecture
SLOT
ArchitectureUsed in
2 PRs
Best BPB
0.7406
Avg BPB
1.0942
Hyperparameters Across PRs
| pr_number | parameters |
|---|---|
| 1321 | {"hidden_delta_shape":"[bsz, 1, 512]","logit_bias_shape":"[bsz, 1, 1024]"} |
| 1425 | — |