← Back to Architecture
GQA + RoPE
ArchitectureUsed in
1 PRs
Best BPB
1.4072
Avg BPB
1.4072
Submissions
Hyperparameters Across PRs
| pr_number | parameters |
|---|---|
| 377 | {"layers":[0,1,2,3,4]} |
| pr_number | parameters |
|---|---|
| 377 | {"layers":[0,1,2,3,4]} |