← Back to Evaluation

neural cache

Evaluation
Used in
1 PRs
Best BPB
1.4245
Avg BPB
1.4245

Hyperparameters Across PRs

pr_numberparameters
304{"hidden_state_dim":512,"dtype":"bf16","interpolation":"logaddexp"}