Develop by solderzzc · Pull Request #151 · SharpAI/DeepCamera

solderzzc · 2026-03-14T00:29:08Z

No description provided.

…parsing, token streaming - Disable Qwen3.5 thinking via empty <think></think> assistant prefix injection - Add balanced brace JSON parser to handle trailing thinking text - Add buffered token streaming with [C]/[R] field tagging - Add smart early abort: 100 reasoning tokens, 2x maxTokens, 2000 global cap - Add full prompt logging with inline image support ([IMG:] protocol) - Add Qwen3.5 recommended non-thinking params (presence_penalty 1.5) - Remove unsupported response_format and chat_template_kwargs for llama-server

fix: benchmark Qwen3.5 compatibility — disable thinking, robust JSON …

solderzzc and others added 3 commits March 13, 2026 17:28

Merge pull request #150 from SharpAI/feature/benchmark-qwen35-fixes

bf4c517

fix: benchmark Qwen3.5 compatibility — disable thinking, robust JSON …

Merge branch 'master' into develop

62b4e28

solderzzc merged commit f367a41 into master Mar 14, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Develop#151

Develop#151
solderzzc merged 3 commits intomasterfrom
develop

solderzzc commented Mar 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

solderzzc commented Mar 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant