← Back to Evaluation

NTK-aware RoPE scaling

Evaluation
Used in
1 PRs
Best BPB
1.2160
Avg BPB
1.2160

Hyperparameters Across PRs

pr_numberparameters
59{"train_length":1024,"eval_length":2048}