PR #1871

open

val_bpb 0.85330 (3-seed mean) Leo

by newjordanView on GitHub
val_bpb
0.8533
Architecture
Transformer
Optimizer
Artifact Size
14,779,396 bytes

Training Techniques

Novel Contributions

  • 3-seed mean submission
  • Leo model variant