fix: benchmark Qwen3.5 compatibility — disable thinking, robust JSON … by solderzzc · Pull Request #150 · SharpAI/DeepCamera

solderzzc · 2026-03-14T00:28:43Z

…parsing, token streaming

Disable Qwen3.5 thinking via empty assistant prefix injection
Add balanced brace JSON parser to handle trailing thinking text
Add buffered token streaming with [C]/[R] field tagging
Add smart early abort: 100 reasoning tokens, 2x maxTokens, 2000 global cap
Add full prompt logging with inline image support ([IMG:] protocol)
Add Qwen3.5 recommended non-thinking params (presence_penalty 1.5)
Remove unsupported response_format and chat_template_kwargs for llama-server

…parsing, token streaming - Disable Qwen3.5 thinking via empty <think></think> assistant prefix injection - Add balanced brace JSON parser to handle trailing thinking text - Add buffered token streaming with [C]/[R] field tagging - Add smart early abort: 100 reasoning tokens, 2x maxTokens, 2000 global cap - Add full prompt logging with inline image support ([IMG:] protocol) - Add Qwen3.5 recommended non-thinking params (presence_penalty 1.5) - Remove unsupported response_format and chat_template_kwargs for llama-server

solderzzc merged commit bf4c517 into develop Mar 14, 2026
1 check passed

solderzzc deleted the feature/benchmark-qwen35-fixes branch March 14, 2026 00:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: benchmark Qwen3.5 compatibility — disable thinking, robust JSON …#150

fix: benchmark Qwen3.5 compatibility — disable thinking, robust JSON …#150
solderzzc merged 1 commit intodevelopfrom
feature/benchmark-qwen35-fixes

solderzzc commented Mar 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

solderzzc commented Mar 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant