← Back to Regularization

entropy token masking

Regularization
Used in
1 PRs
Best BPB
1.1490
Avg BPB
1.1490

Hyperparameters Across PRs

pr_numberparameters
459