PR #1912

open

Improve contest submission

by WesSwoggerView on GitHub
val_bpb
1.3163
Architecture
Optimizer
Artifact Size
15,154,431 bytes

Training Techniques

Novel Contributions

  • Improved export/submission handling in train_gpt.py
  • Updated .gitignore to avoid committing local model artifacts and logs
  • Verified a legal 1xH100 submission under the 16 MB contest cap