← Back to Initialization

SVD-based attention warm-start

Initialization
Used in
1 PRs
Best BPB
1.3525
Avg BPB
1.3525