PG Field Guide
Learn
Techniques
Emerging
PRs
← PR #1869
PR #1873→
PR #1871
open
val_bpb 0.85330 (3-seed mean) Leo
by newjordan
View on GitHub
val_bpb
0.8533
Architecture
Transformer
Optimizer
—
Artifact Size
14,779,396 bytes
Training Techniques
Novel Contributions
3-seed mean submission
Leo model variant