Fix sharding of quantized models with non-power-of-2 bits #3006

kernelpool · 2026-01-18T01:37:06Z

Proposed changes

Sharding 6-bit (and other non-power-of-2) quantized models with certain input dimensions (like 1536) may fail because input_dims *= 32 // bits truncates incorrectly. See: ml-explore/mlx-lm#771 (comment)

For 6-bit with packed dimension 288:

Before: 288 * (32 // 6) = 288 * 5 = 1440 -> wrong
After: (288 * 32) // 6 = 1536 -> correct

This caused shard_linear to fail with:
ValueError: [quantize] ... matrix has shape (6144,1440)

Checklist

Put an x in the boxes that apply.

I have read the CONTRIBUTING document
I have run pre-commit run --all-files to format my code / installed pre-commit prior to committing changes
I have added tests that prove my fix is effective or that my feature works
I have updated the necessary documentation (if needed)

awni

Thanks for fixing that!

kernelpool added 2 commits January 18, 2026 12:19

Fix sharding of quantized models

a053b52

Add regression test

827b273

awni approved these changes Jan 18, 2026

View reviewed changes

awni merged commit ca14d3d into ml-explore:main Jan 18, 2026
15 checks passed

BrewTestBot mentioned this pull request Jan 27, 2026

mlx 0.30.4 Homebrew/homebrew-core#264789

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix sharding of quantized models with non-power-of-2 bits #3006

Fix sharding of quantized models with non-power-of-2 bits #3006

Uh oh!

kernelpool commented Jan 18, 2026 •

edited

Loading

Uh oh!

awni left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix sharding of quantized models with non-power-of-2 bits #3006

Fix sharding of quantized models with non-power-of-2 bits #3006

Uh oh!

Conversation

kernelpool commented Jan 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed changes

Checklist

Uh oh!

awni left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kernelpool commented Jan 18, 2026 •

edited

Loading