← Back to Quantization
int6
QuantizationUsed in
164 PRs
Best BPB
0.0180
Avg BPB
1.1038
Submissions
PR #64by yesbhautik
1.1250PR #88by seanward
1.1605PR #102by unnir
1.1618PR #103by MatthewHRockwell
1.5000PR #110by mr-ashish-panday
1.2244PR #114by saml212
1.1574PR #117by trovatochris
1.1702PR #128by rsavitt
1.1594PR #147by ankitmaloo
1.1631PR #156by dexhunter
1.1602PR #162by raahilshahRECORD
1.1458PR #173by tamoghnokandar
1.1532PR #178by timowhite88
1.1667PR #179by devin-cog
1.1472PR #182by mihir-s-05
1.1844PR #186by mahsumaktas
1.1565PR #191by chris-buckley
1.1598PR #201by machdragon
1.1551PR #204by Akasxh
1.2320PR #208by ajkpersonal
1.1568PR #209by JWLBOYCE
1.1624PR #212by mrdavtan
1.1329PR #215by JayCheng113
1.1548PR #217by kshitizz36
1.1753PR #218by bopmite
1.1248PR #230by MatthewHRockwell
1.1541PR #238by kellyvv
1.5164PR #243by kvmukilan
1.1704PR #246by kvmukilan
1.1704PR #249by kvmukilan
1.1704PR #251by kshitizz36
1.1596PR #262by ibarrajo
1.0539PR #275by ibarrajo
1.0539PR #278by nicolasdickenmann
1.0365PR #289by integrate-your-mind
1.1518PR #290by ibarrajo
1.1354PR #294by sseanliu
1.1645PR #296by sseanliu
1.1645PR #303by sseanliu
1.1436PR #307by dennisimoo
1.1357PR #316by SkywardSyntax
1.2035PR #330by bopmite
1.1609PR #333by mahsumaktas
1.1565PR #344by aryanbhosale
1.1330PR #362by mkenney2
1.1497PR #371by mrdavtan
1.1401PR #373by JoeProAI
1.1634PR #375by charmquark1984
1.1257PR #384by anantdgoel
1.2882PR #394by greqone
1.1247PR #399by abaybektursun
1.1247PR #400by chanwoo-park-official
1.1296PR #416by kshitizz36
1.1230PR #418by yashverms
1.1715PR #424by someone114514
1.1725PR #429by AbhisekBasu1
1.1231PR #432by jadechip
1.5295PR #442by sjp611
1.1027PR #448by handemanai
1.2006PR #452by ofirkris
1.1366PR #462by JoeProAI
1.0672PR #465by LoquiAuris
1.1508PR #465by LoquiAuris
1.1508PR #481by mrdavtan
1.0970PR #485by harsha-gouru
1.1522PR #493by parinzee
1.1309PR #512by MatoTeziTanka
0.9512PR #517by lukacf
0.9789PR #526by Christopher-Lee-McClendon
1.1425PR #548by LoquiAuris
1.0865PR #567by nitSubedi
1.3660PR #568by MatoTeziTanka
0.7853PR #596by AriaAnima
0.6430PR #599by mkenney2
1.1828PR #605by bigbag
0.7227PR #614by bigbag
0.6864PR #646by Upsalla
1.1349PR #661by andrewbaggio1
1.1175PR #668by Christopher-Lee-McClendon
1.0920PR #671by keshav55
1.1807PR #672by andrewbaggio1
1.0781PR #685by andrewbaggio1
1.0366PR #686by msisovic
1.1182PR #696by gravelBridge
1.2622PR #705by seanward
1.2151PR #715by Asukabot0
1.0337PR #722by magicjulio
0.5588PR #727by Asukabot0
0.9674PR #741by andrewbaggio1
0.9850PR #759by markste-in
1.3092PR #767by RichiiiTV
0.9209PR #769by MatoTeziTanka
0.8508PR #770by minh-stakc
0.6672PR #773by siddhantparadox
1.1532PR #776by agalimova
0.9258PR #782by newjordan
0.9362PR #793by pall23-mech
1.2500PR #798by travispchen
0.5466PR #831by sseanliu
1.1284PR #841by someone114514
1.1157PR #857by aruniyer
1.1093PR #883by THUQiXuan
0.0308PR #886by abaybektursun
0.3779PR #891by robbiebusinessacc
1.1428PR #892by robbiebusinessacc
1.1428PR #901by Hilo-Hilo
1.1590PR #907by resouer
0.0960PR #909by sunnypatneedi
0.8609PR #940by antaloaalonso
0.9581PR #978by AnirudhRahul
1.5134PR #990by newjordan
0.7614PR #997by randy06122001-boop
1.4182PR #998by asuramaya
0.5755PR #1007by dillon-blake
1.2252PR #1014by haimianbaobao007
1.6200PR #1028by newjordan
0.8104PR #1030by sofiabod
0.1130PR #1044by greqone
1.8989PR #1048by mrdavtan
1.1724PR #1055by sanyalsunny111
0.9693PR #1056by sofiabod
0.0180PR #1071by AbhayAnandUCSD
1.1455PR #1081by michaelwinczuk
1.1220PR #1108by DbBested
1.1502PR #1112by dillon-blake
1.2252PR #1140by newjordan
1.1874PR #1174by Okropniak
1.3069PR #1180by estesryan
1.0577PR #1183by akaiHuang
1.5080PR #1185by skoustav35
0.9641PR #1186by andrewbaggio1
0.9850PR #1214by gersh
1.1688PR #1227by himanshudongre
1.4841PR #1232by Christopher-Lee-McClendon
1.0929PR #1242by Campbellb
1.0903PR #1243by simon-marcus
1.1230PR #1244by monkeyKingProgrammer
1.1443PR #1253by Okropniak
1.2326PR #1255by akaiHuang
1.5080PR #1282by newjordan
1.1035PR #1307by amrayach
1.1101PR #1320by jpfeiffe
1.1196PR #1330by luciobaiocchi
1.4617PR #1331by dexhunter
1.0900PR #1349by LocalX991
1.3693PR #1354by samacqua
1.1092PR #1414by Abhishek8108
0.7093PR #1418by Park-Tae-Hwan
1.4192PR #1447by shram86
1.1834PR #1463by tsubasagit
1.2774PR #1473by AVINASH0052
1.1156PR #1476by aryan-cs
1.0842PR #1518by abaybektursun
1.0788PR #1531by mini-sarami
1.4537PR #1534by someone114514
1.0846PR #1602by SPThole
1.0744PR #1612by seekerPrice
1.5096PR #1654by IshiPareek
1.2699PR #1663by pablinga19
1.0862PR #1724by Unwindology
1.1803PR #1732by Victory963
1.0785PR #1733by G3sparky
1.3262PR #1740by amrayach
1.0722PR #1741by amrayach
1.0722Hyperparameters Across PRs
| pr_number | bits | scope |
|---|---|---|
| 64 | 6 | mlp, attn, tok_emb |
| 88 | 6 | all large 2D weight matrices |
| 102 | 6 | MLP and attention weight matrices |
| 103 | 6 | block weights with fp16 embedding and fp16 LoRA passthrough |
| 110 | 6 | large 2D matrices; fp16 for tied embedding |
| 114 | 6 | weight matrices |
| 117 | 6 | per-row weights |
| 128 | 6 | MLP and attention weights; tied embeddings kept fp16 |
| 147 | 6 | all |
| 156 | 6 | per-row weights; embeddings kept fp16 |
| 162 | 6 | MLP and attention weights; fp16 passthrough for tied embeddings and last-layer key projection |
| 173 | 6 | weight matrices with per-row scaling; tied embedding and last 2 layers' c_k.weight kept in fp16 |
| 178 | 6 | all |
| 179 | 6 | MLP and attention weights; embeddings kept in fp16 |
| 182 | 6 | middle layers |
| 186 | 6 | per-row weights |
| 191 | 6 | all large weight matrices |
| 201 | 6 | MLP and attention weights; int8 embeddings |
| 204 | 6 | all model weights |
| 208 | 6 | artifact/model weights |
| 209 | 6 | weight bits for model weights; embeddings kept at 16 bits |
| 212 | 6 | all weights |
| 215 | 6 | MLP and attention weights |
| 217 | 6 | all |
| 218 | 6 | all |
| 230 | 6 | per-row weights; tied embeddings kept in fp16 |
| 238 | 6 | all |
| 243 | 6 | all |
| 246 | 6 | all |
| 249 | 6 | all |
| 251 | 6 | all except fp16 embeddings |
| 262 | 6 | all |
| 275 | 6 | model weights |
| 278 | 6 | model weights |
| 289 | 6 | MLP and attention weights |
| 290 | 6 | all |
| 294 | 6 | model weights |
| 296 | 6 | all |
| 303 | 6 | all |
| 307 | 6 | all |
| 316 | 6 | all |
| 330 | 6 | all weights per-row |
| 333 | 6 | per-row weights |
| 344 | 6 | per-row weights |
| 362 | 6 | all |
| 371 | 6 | all |
| 373 | 6 | all |
| 375 | 6 | all |
| 384 | 6 | all |
| 394 | 6 | model artifact |
| 399 | 6 | evaluation artifact / model weights |
| 400 | 6 | mlp, attn |
| 416 | 6 | all |
| 418 | 6 | MLP and attention weight matrices |
| 424 | 6 | baseline model weights |
| 429 | 6 | all |
| 432 | 6 | MLP-only export / model weights with targeted fp16 exceptions |
| 442 | 6 | mixed |
| 448 | 6 | all weights with fp16 embedding passthrough |
| 452 | 6 | attention |
| 462 | 6 | all |
| 465 | 6 | attention |
| 465 | 6 | embeddings |
| 481 | 6 | per-row all weights |
| 485 | 6 | attention weights |
| 493 | 6 | all large weight matrices |
| 512 | 6 | all weight matrices |
| 517 | 6 | all |
| 526 | 6 | all |
| 548 | 6 | MLP and attention weights |
| 567 | 6 | — |
| 568 | 6 | all weight matrices |
| 596 | 6 | all |
| 599 | 6 | all |
| 605 | 6 | all weights with FP16 passthrough for embeddings and control tensors |
| 614 | 6 | all |
| 646 | 6 | — |
| 661 | 6 | all |
| 668 | 6 | per-row, including embeddings |
| 671 | 6 | attention weights |
| 672 | 6 | model weights |
| 685 | 6 | all |
| 686 | 6 | all |
| 696 | 6 | all weights |
| 705 | 6 | all |
| 715 | 6 | all |
| 722 | 6 | all |
| 727 | 6 | per-row weights |
| 741 | 6 | all |
| 759 | 6 | MLP |
| 767 | 6 | all |
| 769 | 6 | all |
| 770 | 6 | per-row |
| 773 | 6 | model weights |
| 776 | 6 | all |
| 782 | 6 | model weights |
| 793 | 6 | all |
| 798 | 6 | all |
| 831 | 6 | per-row weights |
| 841 | 6 | final artifact export |
| 857 | 6 | all |
| 883 | 6 | final artifact |
| 886 | 6 | all |
| 891 | 6 | MLP weights |
| 892 | 6 | MLP weights |
| 901 | 6 | model |
| 907 | 6 | all |
| 909 | 6 | all |
| 940 | 6 | per-row |
| 978 | 6 | all |
| 990 | 6 | all |
| 997 | 6 | block weights |
| 998 | 6 | artifact |
| 1007 | 6 | all |
| 1014 | 6 | all |
| 1028 | 6 | model weights |
| 1030 | 6 | per-row |
| 1044 | 6 | all |
| 1048 | 6 | all weights |
| 1055 | 6 | per-row weights |
| 1056 | 6 | per-row |
| 1071 | 6 | per-row weights |
| 1081 | 6 | all |
| 1108 | 6 | all |
| 1112 | 6 | all |
| 1140 | 6 | final artifact |
| 1174 | 6 | all |
| 1180 | 6 | all |
| 1183 | 6 | all |
| 1185 | 6 | per-row |
| 1186 | 6 | all |
| 1214 | 6 | artifact weights |
| 1227 | 6 | all |
| 1232 | 6 | all |
| 1242 | 6 | all |
| 1243 | 6 | attn, mlp, embed, other floating tensors |
| 1244 | 6 | all |
| 1253 | 6 | all |
| 1255 | 6 | all |
| 1282 | 6 | naive |
| 1307 | 6 | per-row export |
| 1320 | 6 | per-row |
| 1330 | 6 | all |
| 1331 | 6 | all |
| 1349 | 6 | all |
| 1354 | 6 | model |
| 1414 | 6 | all |
| 1418 | 6 | model weights |
| 1447 | 6 | AWQ |
| 1463 | 6 | weights |
| 1473 | 6 | all |
| 1476 | 6 | artifact |
| 1518 | 6 | model |
| 1531 | 6 | all |
| 1534 | 6 | all |
| 1602 | 6 | MLP |
| 1612 | 6 | model artifact |
| 1654 | 6 | all |
| 1663 | 6 | sliding eval artifact |
| 1724 | 6 | all |
| 1732 | 6 | MLP FC1 |
| 1733 | 6 | attention |
| 1740 | 6 | all |
| 1741 | 6 | model |