Fixes non-catching weight init regexes as torch.compile changes the FQNs by le1nux · Pull Request #437 · Modalities/modalities

le1nux · 2026-03-07T09:16:27Z

What does this PR do?

torch.compile changes the FQNs of parameters by adding "_orig_mod." as a prefix to the original FQN. This causes the regexes for matching parameter names to fail. To fix this, we need to remove the "_orig_mod." prefix from the parameter names before matching them against the regexes. This change needs to be made in both the llama3_like_initialization.py and initialization_routines.py files, wherever we are matching parameter names against regexes.

Checklist before submitting final PR

My PR is minimal and addresses one issue in isolation
I have merged the latest version of the target branch into this feature branch
I have reviewed my own code w.r.t. correct implementation, missing type hints, proper documentation, etc.
I have run a sample config for model training
I have checked that all tests run through (python tests/tests.py)
I have updated the internal changelog (CHANGELOG_DEV.md)

…d." as a prefix to the original FQN. This causes the regexes for matching parameter names to fail. To fix this, we need to remove the "_orig_mod." prefix from the parameter names before matching them against the regexes. This change needs to be made in both the llama3_like_initialization.py and initialization_routines.py files, wherever we are matching parameter names against regexes.

BlueCrescent approved these changes Mar 7, 2026

View reviewed changes

le1nux merged commit 8f84b2d into main Mar 7, 2026
3 checks passed

le1nux deleted the fix_compile_weight_init_bug branch March 7, 2026 09:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes non-catching weight init regexes as torch.compile changes the FQNs#437

Fixes non-catching weight init regexes as torch.compile changes the FQNs#437
le1nux merged 1 commit intomainfrom
fix_compile_weight_init_bug

le1nux commented Mar 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

le1nux commented Mar 7, 2026

What does this PR do?

Checklist before submitting final PR

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants