← Back to Regularization

structured pruning

Regularization
Used in
2 PRs
Best BPB
1.1147
Avg BPB
1.1673

Hyperparameters Across PRs

pr_numberparameters
1019{"type":"±1 by reconstruction error"}
1551{"top_k":true,"gradual_budget_annealing":true}