← Back to Architecture
TrigramHash
ArchitectureUsed in
31 PRs
Best BPB
0.9850
Avg BPB
1.1658
Submissions
PR #292by xuafeng
1.3274PR #327by Ananddna
1.1450PR #344by aryanbhosale
1.1330PR #418by yashverms
1.1715PR #440by Ashutosh3142857
1.2219PR #486by ndokutovich
1.1101PR #562by bigbag
1.1354PR #571by maxwellcipher
1.2791PR #635by aryanbhosale
1.1330PR #882by IshiPareek
1.3762PR #884by BhatiaUday
1.1448PR #1089by mikeapedia
1.1086PR #1098by adityakm24
1.1187PR #1117by adityakm24
1.1187PR #1118by adityakm24
1.1187PR #1169by Bortlesboat
1.1126PR #1182by adityakm24
1.1227PR #1186by andrewbaggio1
0.9850PR #1200by Mister2005
1.6768PR #1201by Mister2005
1.6371PR #1311by htrung1105
1.1303PR #1370by Christopher-Lee-McClendon
1.0030PR #1384by iverbovoy
1.1441PR #1440by Mertyandimata
1.1026PR #1501by SPThole
1.1159PR #1544by Abhishek8108
1.0283PR #1545by Abhishek8108
1.0283PR #1553by Abhishek8108
1.2097PR #1602by SPThole
1.0744PR #1632by Hkoyuer
1.0274PR #1749by gracebml
1.0996Hyperparameters Across PRs
| pr_number | parameters |
|---|---|
| 292 | {"buckets":4096,"dim":32} |
| 327 | {"buckets":8192,"dim":64} |
| 344 | {"size":4096,"dim":128} |
| 418 | {"buckets":2048,"dimensions":64} |
| 440 | {"vocab_size":2048,"dim":48} |
| 486 | {"buckets":4096,"dim":128} |
| 562 | — |
| 571 | {"variants":[{"buckets":2048,"embed_dim":64},{"buckets":4096,"embed_dim":96}]} |
| 635 | {"buckets":4096,"dim":128} |
| 882 | {"buckets":8192,"n_gram":3} |
| 884 | {"vocab_size":2048,"trigram_dim":48,"project_dim":512} |
| 1089 | {"heads":2,"buckets":8192} |
| 1098 | {"size":1024} |
| 1117 | {"size":1024} |
| 1118 | {"dimensions":1024} |
| 1169 | {"heads":2,"buckets":8192} |
| 1182 | {"buckets":1024,"dimensions":128} |
| 1186 | {"order":3} |
| 1200 | {"vocab_size":4096,"dimensions":128} |
| 1201 | {"vocab_size":4096,"dimension":128} |
| 1311 | {"enabled":false} |
| 1370 | — |
| 1384 | {"buckets":65000} |
| 1440 | {"buckets":3072,"heads":2} |
| 1501 | — |
| 1544 | — |
| 1545 | — |
| 1553 | — |
| 1602 | — |
| 1632 | — |
| 1749 | — |