-
Notifications
You must be signed in to change notification settings - Fork 435
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[6287717][ONNX][Quantization] Preserve trt.plugins custom-op value_info in clear_stale_value_info
#1697
opened Jun 12, 2026 by
gcunhase
Contributor
Loading…
Remove examples/diffusers/eval example
cherry-pick-0.45.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1694
opened Jun 11, 2026 by
jingyu-ml
Contributor
Loading…
[nvbug 6289151] Fix exported Step layer type metadata
#1693
opened Jun 11, 2026 by
meenchen
Contributor
Loading…
Exclude multimodal vision branch from quantization by default (NVBug 6293731, 6293762)
cherry-pick-0.45.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1691
opened Jun 11, 2026 by
Edwardf0t1
Contributor
Loading…
Fix gemma w4a8_awq recipe crashing export on multimodal checkpoints (NVBug 6294017)
cherry-pick-0.45.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1690
opened Jun 11, 2026 by
Edwardf0t1
Contributor
Loading…
fastgen DMD2: make the Qwen-Image example self-contained on stock nemo_automodel
cherry-pick-0.45.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1688
opened Jun 11, 2026 by
jingyu-ml
Contributor
Loading…
[OMNIML-4922] Four over Six PTQ & Updating Nemotron Ultra Example
#1684
opened Jun 11, 2026 by
jenchen13
Contributor
Loading…
1 task done
Fix GPT-OSS MXFP4->NVFP4 PTQ load, export, and cast (nvbug 6295279, 6295242)
cherry-pick-0.45.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1678
opened Jun 11, 2026 by
cjluo-nv
Collaborator
Loading…
[OMNIML-4998] Per-expert weight quantization on TEGroupedLinear
#1671
opened Jun 10, 2026 by
hychiang-git
Contributor
•
Draft
support new logic of common state dict
cherry-pick-0.45.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1669
opened Jun 10, 2026 by
dimapihtar
Loading…
[6241485] Add support for ONNX Q/DQ node placement for DLA
#1661
opened Jun 9, 2026 by
gcunhase
Contributor
Loading…
Add NVFP4 + QAD to the Nemotron-3-Nano-30B-A3B tutorial
#1660
opened Jun 9, 2026 by
kevalmorabia97
Collaborator
•
Draft
Add fused Triton kernel for local-Hessian NVFP4 weight-scale search
#1659
opened Jun 9, 2026 by
Fridah-nv
Contributor
Loading…
docs(deployment skill): drop wrong "release predates arch" cu130 fallback
#1654
opened Jun 8, 2026 by
Edwardf0t1
Contributor
Loading…
docs(eval skill): drop arch-specific cu130 nightly tag (release images are multi-arch)
#1649
opened Jun 8, 2026 by
cjluo-nv
Collaborator
Loading…
4 of 5 tasks
[OMNIML-4944] peft: add lora_dtype field to PEFTAttributeConfig
#1646
opened Jun 8, 2026 by
hychiang-git
Contributor
•
Draft
Previous Next
ProTip!
no:milestone will show everything without a milestone.