← Back to Architecture

memory tokens

Architecture
Used in
2 PRs
Best BPB
1.1466
Avg BPB
1.4994

Hyperparameters Across PRs

pr_numberparameters
345{"count":16}
421{"tokens":64}