← Back to Architecture

tokenizer/vocabulary size

Architecture
Used in
1 PRs
Best BPB
1.2827
Avg BPB
1.2827

Hyperparameters Across PRs

pr_numberparameters
293{"vocab_size":4096}