← Back to Evaluation

stride-based eval

Evaluation
Used in
62 PRs
Best BPB
0.0214
Avg BPB
1.0044

Submissions

PR #162by raahilshahRECORD
1.1458
PR #209by JWLBOYCE
1.1624
PR #212by mrdavtan
1.1329
PR #285by DanishjeetSingh
1.3510
PR #316by SkywardSyntax
1.2035
PR #348by EthanYangTW
1.1444
PR #393by CrimsonSithria
1.2417
PR #410by EthanYangTW
1.1216
PR #415by EthanYangTW
1.1216
PR #436by CrimsonSithria
1.2392
PR #483by tmustier
1.1346
PR #526by Christopher-Lee-McClendon
1.1425
PR #530by j420
1.4963
PR #533by newjordan
1.1207
PR #554by chrisnkuno
1.4612
PR #588by andyluo22
1.4120
PR #598by Christopher-Lee-McClendon
1.1334
PR #600by humanaiconvention
1.2364
PR #601by anantdgoel
1.1418
PR #615by danialht
1.1169
PR #626by kshitizz36
1.1180
PR #680by bro4all
1.1483
PR #686by msisovic
1.1182
PR #769by MatoTeziTanka
0.8508
PR #776by agalimova
0.9258
PR #779by deanbrr
0.6683
PR #785by SirSaltySalmon
1.5364
PR #790by danialht
1.1172
PR #808by Naazimsnh02
0.6364
PR #811by quietsmile
0.4377
PR #824by sahiee-dev
1.0896
PR #851by RoyiRa
0.2071
PR #868by aamodbhatt
0.1181
PR #901by Hilo-Hilo
1.1590
PR #912by Bortlesboat
0.3461
PR #931by AnirudhRahul
0.0498
PR #948by dentity007
0.1156
PR #962by AnirudhRahul
0.0214
PR #965by Adam-Jacuch
1.1184
PR #968by dentity007
0.1154
PR #1050by Taleef7
1.1194
PR #1128by AnubhavBharadwaaj
1.1154
PR #1174by Okropniak
1.3069
PR #1253by Okropniak
1.2326
PR #1300by Ribin545
1.8184
PR #1349by LocalX991
1.3693
PR #1451by davie2009kh
1.1180
PR #1452by bsisduck
0.3509
PR #1454by bsisduck
0.3509
PR #1478by jxgod
1.1995
PR #1547by adityasasidhar
1.1928
PR #1605by renqianluo
0.2988
PR #1630by KevinChunye
1.1412
PR #1655by himanalot
1.1135
PR #1793by sunburnt716
1.5782
PR #1816by JiaJunDeng5930
1.3915
PR #1857by dexhunter
1.0322
PR #1885by leon2k2k2k
0.9944
PR #2014by simonbissonnetteRECORD
1.0576
PR #2034by Maheshram1
1.0576
PR #2062by BumaldaOverTheWater94
1.2195
PR #2078by hi-aduek
1.0580

Hyperparameters Across PRs

pr_numberparameters
162{"stride":64}
209{"stride":64,"eval_seq_len":2048}
212{"stride":64}
285{"stride":0}
316{"stride":1024}
348{"stride":32}
393{"stride":512}
410{"stride":32}
415{"stride":32}
436{"stride":512}
483{"stride":64}
526{"stride":64}
530{"EVAL_STRIDE":0,"description":"Standard evaluation, not sliding window, for fast iteration"}
533{"stride":32}
554{"stride":256}
588{"stride":64,"eval_batch_seqs":256}
598{"stride":64}
600{"stride":512}
601{"stride":128}
615{"stride":64}
626{"stride":64,"mode":"sliding"}
680{"stride":64}
686{"stride":64}
769{"stride":2048,"seq_len":2048}
776{"stride":64}
779{"stride":64}
785{"stride":64}
790{"stride":64}
808{"stride":64}
811{"stride":128}
824{"stride":64}
851{"stride":64}
868{"two_pass":true,"rescore_chunks":72,"order":12}
901{"stride":64}
912{"stride":64}
931{"stride":64}
948{"stride":64}
962{"stride":64}
965{"stride":64}
968{"stride":64}
1050{"stride":64}
1128{"stride":64}
1174{"stride":64}
1253{"stride":64}
1300{"stride":64}
1349{"stride":64}
1451{"chunk_size":256,"eval_seq_len":1024,"batch_size":64}
1452{"stride":384}
1454{"stride":384}
1478{"stride":1024}
1547{"chunk_size":256,"eval_seq_len":1024,"batch_size":64}
1605{"stride":96}
1630{"stride":64}
1655{"stride":76}
1793{"chunk_size":256,"eval_seq_len":1024,"batch_size":64}
1816{"stride":512}
1857{"stride":64}
1885{"stride":2048}
2014{"stride":1536,"context_length":3072}
2034{"stride":1536,"context_length":3072}
2062{"stride":64}
2078{"stride":1536}