← Back to Regularization

freeze early layers

Regularization
Used in
1 PRs
Best BPB
1.1425
Avg BPB
1.1425

Hyperparameters Across PRs

pr_numberparameters
526{"frozen_blocks":2}