Skip to content

Develop#151

Merged
solderzzc merged 3 commits intomasterfrom
develop
Mar 14, 2026
Merged

Develop#151
solderzzc merged 3 commits intomasterfrom
develop

Conversation

@solderzzc
Copy link
Member

No description provided.

solderzzc and others added 3 commits March 13, 2026 17:28
…parsing, token streaming

- Disable Qwen3.5 thinking via empty <think></think> assistant prefix injection
- Add balanced brace JSON parser to handle trailing thinking text
- Add buffered token streaming with [C]/[R] field tagging
- Add smart early abort: 100 reasoning tokens, 2x maxTokens, 2000 global cap
- Add full prompt logging with inline image support ([IMG:] protocol)
- Add Qwen3.5 recommended non-thinking params (presence_penalty 1.5)
- Remove unsupported response_format and chat_template_kwargs for llama-server
fix: benchmark Qwen3.5 compatibility — disable thinking, robust JSON …
@solderzzc solderzzc merged commit f367a41 into master Mar 14, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant