← Back to Test-Time Training
TTT
Test-Time TrainingUsed in
23 PRs
Best BPB
0.0214
Avg BPB
1.0991
Submissions
PR #196by sicauzxl
1.3825PR #212by mrdavtan
1.1329PR #367by ksang123
1.1770PR #371by mrdavtan
1.1401PR #588by andyluo22
1.4120PR #645by FlynnCruse
1.8990PR #646by Upsalla
1.1349PR #651by phulin
1.2093PR #687by RoyiRa
1.0745PR #818by lucamignatti
0.5527PR #901by Hilo-Hilo
1.1590PR #962by AnirudhRahul
0.0214PR #1026by danielxmed
1.0945PR #1058by resouer
1.1247PR #1184by icryo
0.9485PR #1243by simon-marcus
1.1230PR #1250by ibarrajo
1.2094PR #1251by ibarrajo
1.1349PR #1307by amrayach
1.1101PR #1398by Mertyandimata
1.1047PR #1414by Abhishek8108
0.7093PR #1569by abbudjoe
1.3576PR #1578by mikeapedia
1.0668Hyperparameters Across PRs
| pr_number | parameters |
|---|---|
| 196 | {"run_ttt_eval":1} |
| 212 | {"max_steps":500,"freeze_blocks":1} |
| 367 | {"learning_rate":0.002} |
| 371 | {"epochs":3,"optimizer":"SGD"} |
| 588 | — |
| 645 | — |
| 646 | — |
| 651 | — |
| 687 | {"learning_rate":0.0001,"chunk_tokens":131072,"use_mixer":true} |
| 818 | — |
| 901 | — |
| 962 | {"epochs":0,"freeze_blocks":2,"learning_rate":0.0001} |
| 1026 | — |
| 1058 | {"enabled":false} |
| 1184 | {"enabled":false} |
| 1243 | {"enabled":0} |
| 1250 | — |
| 1251 | — |
| 1307 | {"enabled":false} |
| 1398 | — |
| 1414 | {"variant":"Discriminative TTT","per_block_adaptive_lr":true,"pre_quantization":true} |
| 1569 | {"mode":"off"} |
| 1578 | — |