Skip to content

docs: add Kimi K2, GLM-4.5V and fix Qwen3-VL link#2108

Closed
sbhavani wants to merge 1 commit intoNVIDIA-NeMo:mainfrom
sbhavani:docs/add-kimi-k2-to-readme
Closed

docs: add Kimi K2, GLM-4.5V and fix Qwen3-VL link#2108
sbhavani wants to merge 1 commit intoNVIDIA-NeMo:mainfrom
sbhavani:docs/add-kimi-k2-to-readme

Conversation

@sbhavani
Copy link
Copy Markdown
Contributor

@sbhavani sbhavani commented Jan 28, 2026

What does this PR do ?

Updates the supported models table in README.md to include recently added models and fix a broken link.

Changelog

  • Add Kimi K2 (1T MoE) to supported models table with pretrain recipe
  • Add GLM-4.5V (vision-language) to supported models table with checkpoint conversion and finetune recipe
  • Fix Qwen3-VL recipe link from qwen3vl.py to qwen3_vl.py

GitHub Actions CI

See the CI sectionin the Contributing doc for how to trigger the CI. A Nvidia developer will need to approve and trigger the CI for external contributors.

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

Summary by CodeRabbit

Release Notes

  • New Features

    • Added support for GLM-4.5V model.
    • Added support for Kimi K2 model for pretraining.
  • Documentation

    • Updated Qwen3-VL model recipe reference for accuracy.

✏️ Tip: You can customize this high-level summary in your review settings.

@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai Bot commented Jan 28, 2026

📝 Walkthrough

Walkthrough

README.md was updated to add two new model entries (GLM-4.5V and Kimi K2) with their corresponding recipe links, and corrected the recipe reference for Qwen3-VL from qwen3vl.py to qwen3_vl.py.

Changes

Cohort / File(s) Summary
Documentation Updates
README.md
Added GLM-4.5V and Kimi K2 model rows with recipe links; corrected Qwen3-VL recipe reference filename

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~3 minutes

Suggested reviewers

  • yaoyu-33
  • ananthsub
🚥 Pre-merge checks | ✅ 4
✅ Passed checks (4 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Title check ✅ Passed The title accurately summarizes the main changes: adding two new models (Kimi K2 and GLM-4.5V) to documentation and fixing a broken Qwen3-VL link.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Test Results For Major Changes ✅ Passed Pull request contains only minor documentation updates to README.md (+3/-1 lines) adding two new models and fixing a broken link.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.


Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

Signed-off-by: Santosh Bhavani <santosh.bhavani@live.com>
@sbhavani sbhavani force-pushed the docs/add-kimi-k2-to-readme branch from 8dcb7a8 to 54b3c86 Compare January 28, 2026 19:47
@sbhavani sbhavani requested a review from snowmanwwg January 28, 2026 19:47
@sbhavani sbhavani closed this Mar 31, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant