← Back to Architecture

JEPA-style regression transformer

Architecture
Used in
1 PRs
Best BPB
1.8658
Avg BPB
1.8658

Hyperparameters Across PRs

pr_numberparameters
1513