← Back to Regularization

magnitude pruning

Regularization
Used in
64 PRs
Best BPB
0.0235
Avg BPB
1.0473

Submissions

PR #180by thwu1RECORD
1.1428
PR #267by andrewgcodes
1.1374
PR #292by xuafeng
1.3274
PR #295by gowtham0992
1.1477
PR #324by crony-io
1.1702
PR #326by crony-io
1.2890
PR #333by mahsumaktas
1.1565
PR #348by EthanYangTW
1.1444
PR #349by Mapika
1.1399
PR #366by shivnarainms22
1.1574
PR #434by parinzee
1.1370
PR #443by CREVIOS
1.1431
PR #447by CREVIOS
1.1431
PR #515by keshav55
1.1807
PR #544by EthanYangTW
1.1179
PR #563by instax-dutta
1.1428
PR #633by MatoTeziTanka
1.1526
PR #700by RoyiRa
1.0541
PR #759by markste-in
1.3092
PR #807by connectwithprakash
1.0116
PR #871by greqone
0.8004
PR #872by gowtham0992
1.0467
PR #880by RoyiRa
0.1003
PR #921by TimPietrusky
0.0939
PR #931by AnirudhRahul
0.0498
PR #945by TimPietrusky
0.0274
PR #960by ADIITJ
1.1882
PR #989by alexanderaperry-arch
1.1402
PR #1004by ibarrajo
1.1182
PR #1032by wfproc
1.3631
PR #1037by TimPietruskyRunPod
1.1179
PR #1048by mrdavtan
1.1724
PR #1058by resouer
1.1247
PR #1072by vimeto
1.1170
PR #1077by malc3om
1.1130
PR #1108by DbBested
1.1502
PR #1111by MichaelMcCulloch
0.2532
PR #1114by minh-stakc
0.0235
PR #1126by AnirudhRahul
1.1091
PR #1144by inFaaa
1.3572
PR #1171by EthanYangTW
1.1145
PR #1215by turbo-indubitable
1.1601
PR #1226by Wolfie8935
1.1428
PR #1234by ibarrajo
1.1461
PR #1236by ibarrajo
1.1179
PR #1237by ibarrajo
1.1198
PR #1248by ibarrajo
1.1264
PR #1284by tyrel-beede
1.1207
PR #1290by aryanbhosale
1.1104
PR #1305by DariusFeher
1.2070
PR #1345by shasank0001
1.1763
PR #1346by shasank0001
1.2283
PR #1347by shasank0001
1.3038
PR #1357by mollahasani
1.2200
PR #1369by xiayicheng3-code
1.1196
PR #1395by dttdrv
1.0924
PR #1401by teerthsharma
1.1100
PR #1421by X-Abhishek-X
1.0925
PR #1461by viasky657
0.4118
PR #1535by newjordan
1.0742
PR #1537by pireylow
1.3971
PR #1537by pireylow
1.3971
PR #1602by SPThole
1.0744
PR #1753by Abhishek-Dalvi410
1.2917

Hyperparameters Across PRs

pr_numberparameters
180{"sparsity":0.03}
267{"sparsity":"3%"}
292
295{"prune_frac":0.08}
324{"prune_fraction":0.05}
326{"pruned_fraction":0.05}
333{"sparsity":0.02}
348{"sparsity":0.05}
349{"pruning_ratio":0.08}
366{"sparsity":0.03}
434{"pruning_rate":0.08}
443{"prune_frac":0.03}
447{"fraction":0.03}
515{"pruning_amount":"3%"}
544{"pct":0.02}
563{"prune_percent":3}
633{"percentage":3}
700{"sparsity":0.03}
759{"sparsity":"1%"}
807{"sparsity":0.03}
871{"sparsity":"3%"}
872{"sparsity":0.03,"timing":"post-quant"}
880{"pct":0.05}
921{"prune_rate":0.05}
931{"prune_pct":0.05}
945{"pruning":"3%"}
960{"sparsity":0.03}
989{"pct":10}
1004{"pruning_rate":0.05}
1032{"fraction":0.05}
1037{"sparsity":0.04}
1048{"sparsity":0.03}
1058{"mode":"selective_prune_pre_quant","enabled":true}
1072{"type":"selective ±1 pruning"}
1077{"prune_fraction":0.03}
1108{"sparsity":0.078}
1111{"sparsity":"2%"}
1114{"sparsity":"3%"}
1126{"threshold":"|q| <= 2"}
1144{"sparsity":0.1}
1171{"sparsity":"3%"}
1215{"prune_pct":0.1,"scope":"2D tensors > 65536 params"}
1226{"sparsity":"3%"}
1234{"sparsity":0.1}
1236{"sparsity":"10%"}
1237{"sparsity":0.1}
1248{"selective":true}
1284{"values":"±1","selective":true}
1290{"type":"selective +/-1","criterion":"reconstruction error"}
1305{"prune_pct":0.03}
1345{"pct":0.033}
1346{"pct":0.032}
1347{"pct":0.032}
1357{"sparsity":"15-22%","schedule":"cubic"}
1369{"selective":true}
1395{"factor":4}
1401{"magnitude_prune":"4%"}
1421{"type":"selective pruning","values_pruned":290000}
1461{"threshold":1}
1535
1537{"pattern":"2:4 structured sparsity","scope":"MLP weight matrices"}
1537{"pattern":"2:4 structured sparsity","scope":"MLP weight matrices","importance":"Hessian-guided"}
1602{"sparsity":0.05}
1753{"binary_masks":true,"learned_mask_scores":true}