support new logic of common state dict#1669
Conversation
Signed-off-by: dimapihtar <dpykhtar@nvidia.com>
|
No actionable comments were generated in the recent review. 🎉 ℹ️ Recent review info⚙️ Run configurationConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Enterprise Run ID: 📒 Files selected for processing (1)
📝 WalkthroughWalkthroughThis PR adds backward compatibility to ChangesBackward-compatible modelopt checkpoint restoration
Estimated code review effort🎯 2 (Simple) | ⏱️ ~8 minutes 🚥 Pre-merge checks | ✅ 6✅ Passed checks (6 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing Touches🧪 Generate unit tests (beta)
Comment |
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #1669 +/- ##
==========================================
- Coverage 77.31% 77.30% -0.01%
==========================================
Files 509 509
Lines 55912 55915 +3
==========================================
- Hits 43226 43225 -1
- Misses 12686 12690 +4
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Harness. 🚀 New features to boost your workflow:
|
|
@dimapihtar can you fix the code quality issues: https://github.com/NVIDIA/Model-Optimizer/actions/runs/27283492786/job/80584183938?pr=1669 |
What does this PR do?
Adds support for new logic of common state dict in MCore. MCore's PR: NVIDIA/Megatron-LM#5160
Usage
# Add a code snippet demonstrating how to use thisTesting
Before your PR is "Ready for review"
Make sure you read and follow Contributor guidelines and your commits are signed (
git commit -s -S).Make sure you read and follow the Security Best Practices (e.g. avoiding hardcoded
trust_remote_code=True,torch.load(..., weights_only=False),pickle, etc.).CONTRIBUTING.md: ✅ / ❌ / N/AAdditional Information
Summary by CodeRabbit
Bug Fixes