[Ascend] support qwen3.5 35BA3B by wanfengcxz · Pull Request #4485 · InternLM/lmdeploy

wanfengcxz · 2026-04-01T12:50:23Z

Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily receiving feedbacks. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers.

Motivation

Please describe the motivation of this PR and the goal you want to achieve through this PR.

Modification

Please briefly describe what modification is made in this PR.

BC-breaking (Optional)

Does the modification introduce changes that break the backward-compatibility of the downstream repositories?
If so, please describe how it breaks the compatibility and how the downstream projects should modify their code to keep compatibility with this PR.

Use cases (Optional)

If this PR introduces a new feature, it is better to list some use cases here, and update the documentation.

Checklist

Pre-commit or other linting tools are used to fix the potential lint issues.
The modification is covered by complete unit tests. If not, please add more unit tests to ensure the correctness.
If the modification has a dependency on downstream projects of a newer version, this PR should be tested with all supported versions of downstream projects.
The documentation has been modified accordingly, like docstring or example tutorials.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Copilot

Pull request overview

Adds Ascend/DLinfer support plumbing needed to run Qwen3.5 35B A3B (notably the linear-attention / gated-delta path) by propagating device_type, extending attention metadata, and initializing Ascend-specific Triton device properties.

Changes:

Propagate device_type into HF→ModelConfig build path and use it to pick bf16/fp16 state dtypes for Qwen3.5 configs.
Extend DLInfer attention metadata and populate Ascend attention metadata with GDN-related sequence info.
Add Ascend backend initialization for Triton device properties with fallback warnings when triton-ascend is missing.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 4 comments.

File	Description
lmdeploy/pytorch/configurations/qwen3_5.py	Use `device_type` when selecting bf16 vs fp16 for Qwen3.5 state caches.
lmdeploy/pytorch/config.py	Thread `device_type` through `ModelConfig.from_hf_config` into the model config builder.
lmdeploy/pytorch/backends/dlinfer/attention.py	Add `has_initial_state` field to DLInfer attention metadata.
lmdeploy/pytorch/backends/dlinfer/ascend/op_backend.py	Populate GDN sequence metadata and initialize Triton device properties for Ascend backend.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

lmdeploy/pytorch/backends/dlinfer/ascend/op_backend.py

lmdeploy/pytorch/backends/dlinfer/attention.py

lmdeploy/pytorch/configurations/qwen3_5.py

lmdeploy/pytorch/backends/dlinfer/ascend/op_backend.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

grimoire

LGTM

wanfengcxz and others added 5 commits April 1, 2026 07:17

[Ascend] support qwen3.5

8f92c4d

fix: update import path for init_device_properties_triton

7950c13

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

[ascend] fix missing device_type

aed7e7a

[ascend] refactor code

446f859

[ascend] add comment

d129b14

jinminxi104 self-requested a review April 7, 2026 03:40

wanfengcxz and others added 3 commits April 7, 2026 04:48

[ascend] add comment

e5db153

Merge branch 'main' into wq/qwen35

e095d6b

Fix syntax error in device_type assignment

9c3aae7

jinminxi104 approved these changes Apr 7, 2026

View reviewed changes

jinminxi104 marked this pull request as ready for review April 7, 2026 09:46

Copilot AI review requested due to automatic review settings April 7, 2026 09:46

Copilot started reviewing on behalf of jinminxi104 April 7, 2026 09:47 View session

Copilot AI reviewed Apr 7, 2026

View reviewed changes

wanfengcxz and others added 2 commits April 7, 2026 18:59

Apply suggestion from @Copilot

c489d43

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Apply suggestion from @Copilot

3f49521

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

jinminxi104 requested review from grimoire and lvhan028 April 7, 2026 14:18

grimoire approved these changes Apr 8, 2026

View reviewed changes

lvhan028 added the enhancement New feature or request label Apr 8, 2026

lvhan028 merged commit 687385e into InternLM:main Apr 8, 2026
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Ascend] support qwen3.5 35BA3B#4485

[Ascend] support qwen3.5 35BA3B#4485
lvhan028 merged 10 commits intoInternLM:mainfrom
wanfengcxz:wq/qwen35

wanfengcxz commented Apr 1, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

grimoire left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

wanfengcxz commented Apr 1, 2026

Motivation

Modification

BC-breaking (Optional)

Use cases (Optional)

Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

grimoire left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants