Skip to content

Add Voxtral-Mini-4B-Realtime model support via vLLM backend#8956

Open
localai-bot wants to merge 1 commit intomudler:masterfrom
localai-bot:fix/issue-20260310_190622_task_8401-voxtral-mini-4b-realtime
Open

Add Voxtral-Mini-4B-Realtime model support via vLLM backend#8956
localai-bot wants to merge 1 commit intomudler:masterfrom
localai-bot:fix/issue-20260310_190622_task_8401-voxtral-mini-4b-realtime

Conversation

@localai-bot
Copy link
Contributor

Summary

This PR adds support for the Voxtral-Mini-4B-Realtime-2602 model from Mistral AI via the vLLM backend.

Changes

  • Added gallery definition file:
  • Updated gallery index to point to the new model configuration
  • Configured vLLM backend with recommended settings for real-time ASR

Model Details

  • Model: mistralai/Voxtral-Mini-4B-Realtime-2602
  • Type: Multilingual real-time speech transcription (ASR)
  • Latency: <500ms
  • Languages: 13 supported languages
  • Backend: vLLM with Realtime API support

Testing

The model can be used for real-time transcription workflows once this PR is merged. Users can install the model via the LocalAI gallery.

References

@netlify
Copy link

netlify bot commented Mar 11, 2026

Deploy Preview for localai ready!

Name Link
🔨 Latest commit da4c8d2
🔍 Latest deploy log https://app.netlify.com/projects/localai/deploys/69b16c52643fac00084b0fbf
😎 Deploy Preview https://deploy-preview-8956--localai.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

- Add gallery definition for Voxtral-Mini-4B-Realtime-2602 model
- Configure vLLM backend with recommended settings for real-time ASR
- Update gallery index to point to new model configuration
- Model supports multilingual transcription with <500ms latency
- Uses vLLM's Realtime API for streaming audio processing

References:
- https://huggingface.co/mistralai/Voxtral-Mini-4B-Realtime-2602
- mudler#8401

Signed-off-by: team-coding-agent-1 <team-coding-agent-1@localai.dev>
@localai-bot localai-bot force-pushed the fix/issue-20260310_190622_task_8401-voxtral-mini-4b-realtime branch from 3fc6d53 to da4c8d2 Compare March 11, 2026 13:21
@localai-bot localai-bot moved this to In review in LocalAI Agent team Mar 11, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: In review

Development

Successfully merging this pull request may close these issues.

1 participant