← Back to Test-Time Training

FiLM-only TTT

Test-Time Training
Used in
1 PRs
Best BPB
1.3151
Avg BPB
1.3151

Hyperparameters Across PRs

pr_numberparameters
1383{"learning_rate":0.002,"epochs":3,"chunk_tokens":32768,"momentum":0.9}