Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][fix] ADP router crashes on serve when scheduling_params.attent…
#14267 opened May 18, 2026 by nv-guomingz Collaborator Loading…
1 task done
[TRTLLM-12719][cbts] Add core code related rule
#14266 opened May 18, 2026 by crazydemo Collaborator Loading…
1 task done
[None][feat] Pre-allocate multimodal encoder attention workspace api-compatible Accepted LLM API contract change that is backwards-compatible
#14264 opened May 18, 2026 by yechank-nvidia Collaborator Draft
[None][chore] Auto-update test durations from post-merge
#14262 opened May 18, 2026 by tensorrt-cicd Collaborator Loading…
[None][test] Waive 2 failed cases for main in QA CI
#14260 opened May 18, 2026 by xinhe-nv Collaborator Loading…
[None][infra] Waive 1 failed cases for main in post-merge
#14259 opened May 18, 2026 by xinhe-nv Collaborator Loading…
fix token_range_end add extra_kv_num_tokens
#14258 opened May 18, 2026 by chuangz0 Collaborator Loading…
1 task done
[None][test] Waive 7 failed cases for main in QA CI
#14257 opened May 18, 2026 by xinhe-nv Collaborator Draft
[None][feat] Add swiglu clamp to CuteDSL NVF4 MOE FC1 kernels
#14256 opened May 18, 2026 by liyuhannnnn Collaborator Loading…
1 task done
[None][fix] Handle None kv_cache in add_dummy_requests deepseek-v4
#14255 opened May 18, 2026 by Shixiaowei02 Collaborator Loading…
1 task done
[None][fix] DSv4 o_a_proj: decouple from use_cute_dsl_blockscaling_bmm
#14254 opened May 18, 2026 by lishicheng1996-nv Collaborator Loading…
1 task done
[None][test] Waive 1 failed cases for main in QA CI
#14253 opened May 18, 2026 by xinhe-nv Collaborator Draft
[https://nvbugs/6180247][fix] fix cache transceiver for dsv4 branch deepseek-v4
#14251 opened May 18, 2026 by chuangz0 Collaborator Loading…
1 task done
[None][perf] fuse quant into norm and rope deepseek-v4
#14250 opened May 18, 2026 by mingyangHao Collaborator Draft
1 task
[None][chore] Update flashinfer-python from 0.6.10 to 0.6.11
#14247 opened May 18, 2026 by lfr-0531 Collaborator Loading…
1 task done
[TRTLLM-12751][feat] visual-gen /metrics iteration stats producer
#14246 opened May 18, 2026 by JunyiXu-nv Collaborator Loading…
1 task done
[None][refactor] clean up AttentionForwardArgs
#14244 opened May 18, 2026 by yuxianq Collaborator Loading…
1 task done
[None][fix] Update JIT libs using upgraded NVCC version 13.2
#14242 opened May 18, 2026 by heyuhhh Collaborator Loading…
1 task
[None][perf] DSv4 mem-opts: RoPE cap + MHC pool routing (split from #14053)
#14241 opened May 18, 2026 by lancelly Collaborator Draft
3 of 4 tasks
ProTip! no:milestone will show everything without a milestone.