← Back to Test-Time Training
TTT-Linear
Test-Time TrainingUsed in
1 PRs
Best BPB
1.1347
Avg BPB
1.1347
Submissions
Hyperparameters Across PRs
| pr_number | parameters |
|---|---|
| 1166 | {"heads":8,"mini_batch":16,"learning_rate":1} |