← Back to Architecture

YaRN positional encoding

Architecture
Used in
1 PRs
Best BPB
1.1239
Avg BPB
1.1239

Hyperparameters Across PRs

pr_numberparameters
641{"max_len":2048,"rope_base":5000}