← Back to Test-Time Training

MLP-all TTT

Test-Time Training
Used in
1 PRs
Best BPB
1.1142
Avg BPB
1.1142

Hyperparameters Across PRs

pr_numberparameters
756{"learning_rate":0.002,"epochs":3,"chunk_tokens":32768,"stride":64}