← Back to Architecture

INL BetaMu attention

Architecture
Used in
1 PRs
Best BPB
1.4072
Avg BPB
1.4072

Hyperparameters Across PRs

pr_numberparameters
377{"layers":[5,6,7,8]}