Skip to content

fix(flux.2-klein-9b): Use Qwen3-8b to avoid GGML assertion failure on tensor mismatch#8995

Merged
mudler merged 1 commit intomudler:masterfrom
richiejp:fix/flux2-klein
Mar 13, 2026
Merged

fix(flux.2-klein-9b): Use Qwen3-8b to avoid GGML assertion failure on tensor mismatch#8995
mudler merged 1 commit intomudler:masterfrom
richiejp:fix/flux2-klein

Conversation

@richiejp
Copy link
Collaborator

Avoid incomprehensible messages about not being able to matrix multiply from GGML. Which is because the output layer from Qwen3 4b has the wrong dimensions for the bigger Flux.2 klien model.

… tensor mismatch

Signed-off-by: Richard Palethorpe <io@richiejp.com>
@netlify
Copy link

netlify bot commented Mar 13, 2026

Deploy Preview for localai ready!

Name Link
🔨 Latest commit 30b0992
🔍 Latest deploy log https://app.netlify.com/projects/localai/deploys/69b42cc02b900a0008a2c53c
😎 Deploy Preview https://deploy-preview-8995--localai.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@mudler mudler merged commit 87b3e10 into mudler:master Mar 13, 2026
45 of 46 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants