← Back to Architecture

low-rank GRU state carry

Architecture
Used in
1 PRs
Best BPB
1.2271
Avg BPB
1.2271

Hyperparameters Across PRs

pr_numberparameters
298{"rank":16}