← Back to Quantization

mixed int6

Quantization

Used in

19 PRs

Best BPB

0.0972

Avg BPB

1.0655

Submissions

PR #135by unnir

PR #174by Julz19

PR #339by sheeki03

PR #398by felipe-parodi

PR #581by teddyoweh

PR #649by pall23-mech

PR #684by DeepReinforce

PR #698by hesong0222-dev

PR #811by quietsmile

PR #922by greqone

PR #993by aerosta

PR #1052by demouo

PR #1427by kjahan

PR #1438by sabdulmajid

PR #1465by sisegod

PR #1569by abbudjoe

PR #1664by zoharb157

PR #1665by mrbese

PR #1696by kings-crown

Hyperparameters Across PRs

pr_number	bits	scope
135	6	MLP and attention weight matrices; FP16 passthrough for tied embeddings and last 2 layers' Key projections
174	6	large MLP and attention matrices
339	6	model weights
398	6	all
581	6	model weights
649	6	all
684	6	model weights
698	6	all
811	6	model weights
922	6	model
993	6	post-training mixed
1052	6	artifact
1427	6	model weights
1438	6	mlp;attn;embed
1465	6	embeddings
1569	6	default export
1664	6	all
1665	6	MLP, attention, and Mamba projection weights
1696	6	attention/MLP banks