← Back to Architecture

spiking MLP

Architecture
Used in
1 PRs
Best BPB
1.2982
Avg BPB
1.2982

Hyperparameters Across PRs

pr_numberparameters
664{"layers":9,"width":512,"attention_heads":8,"kv_heads":4,"sequence_length":1024,"snn_steps":2}