← Back to Evaluation

long context eval

Evaluation
Used in
15 PRs
Best BPB
1.0579
Avg BPB
1.1731

Hyperparameters Across PRs

pr_numberparameters
61{"context_length":1408}
104{"context_length":2048}
113{"context_length":960}
136{"context_length":2048}
831{"cache_tokens":8192,"effective_context":50000}
1103{"context_length":4096}
1154{"context_length":32768}
1863{"context_length":16384}
1953{"context_length":2560}
1963{"context_length":2560}
1978{"context_length":3072}
2006{"context_length":2560}
2007{"context_length":2560}
2019{"context_length":2560}
2060{"eval_length":2560}