fix(dpa): handle NULL type properly in lammps plugin by link89 · Pull Request #5268 · deepmodeling/deepmd-kit

link89 · 2026-02-25T08:53:21Z

fix: #4749
related to: #4889

Summary by CodeRabbit

Bug Fixes
- Standardized handling of NULL/placeholder atom types across DeepMD/DeepSpin pair styles so ghost/NULL types are represented consistently, reducing misclassification and improving simulation reliability.
Tests
- Added tests covering NULL/placeholder type scenarios in DeepMD pair-style type-map configurations to prevent regressions and validate the updated behavior.

coderabbitai · 2026-02-25T08:57:26Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Map "NULL" atom types to sentinel -1 instead of type_map.size() in DeepMD-related pair/fix code; add unit tests exercising pair_coeff with NULL types.

Changes

Cohort / File(s)	Summary
Pair style / fix updates `source/lmp/fix_dplr.cpp`, `source/lmp/pair_deepmd.cpp`, `source/lmp/pair_deepspin.cpp`	When a coefficient `type_name` equals `"NULL"`, code now records `-1` in type index maps (ghost types) instead of `type_map.size()`. No other control flow or public signatures changed.
Test coverage `source/lmp/tests/test_lammps_dpa_pt.py`, `source/lmp/tests/test_lammps.py`	Added `test_pair_deepmd_type_map_with_null(lammps_type_map)` to exercise DeepMD `pair_coeff` handling of `NULL` coefficients in type-map scenarios.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Suggested labels

C++

Suggested reviewers

wanghan-iapcm
njzjz
iProzd

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 22.22% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'fix(dpa): handle NULL type properly in lammps plugin' clearly and specifically describes the main change: fixing NULL type handling in the LAMMPS DeepMD plugin.
Linked Issues check	✅ Passed	The PR addresses the core objective of issue `#4749` by changing how NULL atom types are represented (using sentinel value -1 instead of type_map.size()), enabling safe handling of NULL coefficients without crashes.
Out of Scope Changes check	✅ Passed	All code changes are directly related to the NULL type handling objective: three source files implement the fix, and two test files provide regression test coverage for the NULL type scenario.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

source/lmp/tests/test_lammps_dpa_pt.py (1)
497-500: Consider adding a basic sanity assertion.

The test is a crash-regression smoke test (no assertions), which is acceptable given the bug being fixed (double-free). However, since run(0) with only H atoms active is a well-defined state, adding even a simple assert lammps_type_map.eval("pe") != 0 (or a pytest.approx against a known value) would make the test more meaningful and protect against silent incorrect behavior.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@source/lmp/tests/test_lammps_dpa_pt.py` around lines 497 - 500, In
test_pair_deepmd_type_map_with_null, after calling lammps_type_map.run(0) add a
simple sanity assertion that checks the computed potential energy is sensible
(for example assert lammps_type_map.eval("pe") != 0 or use pytest.approx against
a known value); place the assertion immediately after lammps_type_map.run(0) to
ensure the test still guards against silent incorrect behavior while keeping it
as a smoke/regression test for the double-free fix.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@source/lmp/tests/test_lammps_dpa_pt.py`:
- Around line 495-502: Add the missing blank line separating top-level test
functions: ensure there are two blank lines between the function definitions
test_pair_deepmd_type_map_with_null and test_pair_deepmd_real by inserting one
more blank line after the end of test_pair_deepmd_type_map_with_null (the
lammps_type_map.run(0) line), then run ruff format/check to confirm PEP8 E302 is
satisfied.

---

Nitpick comments:
In `@source/lmp/tests/test_lammps_dpa_pt.py`:
- Around line 497-500: In test_pair_deepmd_type_map_with_null, after calling
lammps_type_map.run(0) add a simple sanity assertion that checks the computed
potential energy is sensible (for example assert lammps_type_map.eval("pe") != 0
or use pytest.approx against a known value); place the assertion immediately
after lammps_type_map.run(0) to ensure the test still guards against silent
incorrect behavior while keeping it as a smoke/regression test for the
double-free fix.

ℹ️ Review info

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 65eea4b and 05d7bef.

📒 Files selected for processing (4)

source/lmp/fix_dplr.cpp
source/lmp/pair_deepmd.cpp
source/lmp/pair_deepspin.cpp
source/lmp/tests/test_lammps_dpa_pt.py

source/lmp/tests/test_lammps_dpa_pt.py

for more information, see https://pre-commit.ci

Copilot

Pull request overview

Fixes handling of NULL atom types in DeepMD-related LAMMPS plugins by using a sentinel mapping value, and adds a regression test to cover the NULL type-map case.

Changes:

Map NULL types to -1 instead of type_map.size() in multiple LAMMPS plugin components.
Add a LAMMPS Python test to ensure pair_coeff accepts a NULL type entry without crashing.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 5 comments.

File	Description
source/lmp/tests/test_lammps_dpa_pt.py	Adds a regression test for `NULL` in the DeepMD type map.
source/lmp/pair_deepspin.cpp	Changes `NULL` type index mapping to `-1` sentinel.
source/lmp/pair_deepmd.cpp	Changes `NULL` type index mapping to `-1` sentinel.
source/lmp/fix_dplr.cpp	Changes `NULL` type index mapping to `-1` sentinel.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

source/lmp/pair_deepmd.cpp

source/lmp/pair_deepspin.cpp

source/lmp/fix_dplr.cpp

source/lmp/pair_deepmd.cpp

source/lmp/tests/test_lammps_dpa_pt.py

Signed-off-by: link89 <xuweihong.cn@gmail.com>

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@source/lmp/tests/test_lammps_dpa_pt.py`:
- Around line 498-501: The current test test_pair_deepmd_type_map_with_null only
uses pair_style("deepmd ...") and doesn't exercise the hybrid/scaled + NULL
combination reported in Issue `#4749`; update or add a test that sets pair_style
to a hybrid/scaled form that includes deepmd (e.g., pair_style("hybrid/scaled
deepmd ...")) and then call pair_coeff with the same NULL mapping
(lammps_type_map.pair_coeff("* * H O NULL")) and run(0) to ensure the
hybrid/scaled + NULL path is exercised and does not crash; locate the test
function test_pair_deepmd_type_map_with_null and modify its pair_style
invocation (or create a new test with a distinct name) to use the hybrid/scaled
variant so the exact failing configuration is covered.

ℹ️ Review info

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between d61ffbb and dc6b163.

📒 Files selected for processing (1)

source/lmp/tests/test_lammps_dpa_pt.py

source/lmp/tests/test_lammps_dpa_pt.py

Signed-off-by: link89 <xuweihong.cn@gmail.com>

coderabbitai

🧹 Nitpick comments (1)

source/lmp/tests/test_lammps.py (1)

520-524: Smoke test has no assertions — consider verifying forces on non-NULL atoms or zero forces on NULL atoms.

The test correctly exercises the NULL type code path (type-2 atoms in data_type_map_file are mapped to NULL), making it an effective crash-regression guard for issue #4749. However, without any assertion, a silent wrong-result regression (e.g., non-zero forces being computed for NULL atoms) would go undetected.

A lightweight addition would be to assert that atoms of the NULL type receive zero force:

💡 Suggested assertion for NULL-type atoms

 def test_pair_deepmd_type_map_with_null(lammps_type_map) -> None:
     lammps_type_map.pair_style(f"deepmd {pb_file.resolve()}")
     lammps_type_map.pair_coeff("* * H NULL")
     lammps_type_map.run(0)
+    # type_HO maps type 2 → NULL; atoms at 0-based indices 0 and 3 are type 2
+    null_atom_ids = [
+        i for i, t in enumerate(type_HO) if t == 2
+    ]  # 0-based positions of NULL atoms
+    for ii in range(len(lammps_type_map.atoms)):
+        atom = lammps_type_map.atoms[ii]
+        if (atom.id - 1) in null_atom_ids:
+            assert atom.force == pytest.approx([0.0, 0.0, 0.0])

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@source/lmp/tests/test_lammps.py` around lines 520 - 524, The test function
test_pair_deepmd_type_map_with_null currently only runs LAMMPS without
assertions; update it to fetch the computed forces after lammps_type_map.run(0)
and assert that atoms mapped to NULL (type-2 in the data_type_map_file) have
zero force (or that non-NULL atoms have non-zero forces) to prevent silent
regressions — locate the test by function name and add a small check using the
LAMMPS wrapper's force-accessor (or API used elsewhere in tests) after calling
lammps_type_map.run(0) to verify per-atom forces for NULL vs non-NULL types.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@source/lmp/tests/test_lammps.py`:
- Around line 520-524: The test function test_pair_deepmd_type_map_with_null
currently only runs LAMMPS without assertions; update it to fetch the computed
forces after lammps_type_map.run(0) and assert that atoms mapped to NULL (type-2
in the data_type_map_file) have zero force (or that non-NULL atoms have non-zero
forces) to prevent silent regressions — locate the test by function name and add
a small check using the LAMMPS wrapper's force-accessor (or API used elsewhere
in tests) after calling lammps_type_map.run(0) to verify per-atom forces for
NULL vs non-NULL types.

ℹ️ Review info

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between dc6b163 and ef8b152.

📒 Files selected for processing (1)

source/lmp/tests/test_lammps.py

codecov · 2026-02-25T14:57:38Z

Codecov Report

❌ Patch coverage is 79.48718% with 16 lines in your changes missing coverage. Please review.
✅ Project coverage is 82.00%. Comparing base (65eea4b) to head (87d7b33).
⚠️ Report is 21 commits behind head on master.

Files with missing lines	Patch %	Lines
source/api_cc/src/DeepPotPT.cc	81.33%	12 Missing and 2 partials ⚠️
source/lmp/fix_dplr.cpp	0.00%	1 Missing ⚠️
source/lmp/pair_deepspin.cpp	0.00%	1 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff            @@
##           master    #5268    +/-   ##
========================================
  Coverage   82.00%   82.00%            
========================================
  Files         750      750            
  Lines       75082    75263   +181     
  Branches     3615     3627    +12     
========================================
+ Hits        61571    61723   +152     
- Misses      12347    12375    +28     
- Partials     1164     1165     +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Signed-off-by: link89 <xuweihong.cn@gmail.com>

@link89

* Initial plan * fix(api_cc): fix segfault in DeepPotPT when using NULL types with hybrid/scaled Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * debug: add logging to DeepPotPT and test to narrow down crash location Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * fix(api_cc): fix use-after-free in mapping_tensor by persisting mapping_data as member variable Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * fix(api_cc): fix segfault in DeepPotPT when using NULL types with hybrid/scaled pair style Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * fix: ELOOP: too many symbolic links encountered * fix(api_cc): apply fwd_map to mapping_data to convert original to real atom indices Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * fix(api_cc): fix segfault in DeepPotPT when using NULL types with hybrid/scaled pair style Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * fix(api_cc): skip message passing comm_dict when NULL-type atoms are present When nall_real < nall (NULL-type atoms filtered out), the LAMMPS sendlist is in original-space and may contain atom indices >= nall_real. Passing this sendlist to forward_lower causes OOB access inside the DPA-2 model. Fix: detect has_null_atoms = (nall_real < nall) and skip comm_dict when NULL atoms exist. The ghost atoms are already in the coordinate tensor, so the model can still compute correctly without explicit message passing. Also add sendlist diagnostic logging (first entry per swap, valid range) to expose the index-space mismatch for future debugging. Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * fix(api_cc): remap sendlist from original-space to real-space when NULL atoms present Instead of skipping comm_dict when NULL-type atoms are present, properly remap the sendlist from original-space to real-space using fwd_map: - Iterate sendlist[s][j] per swap, apply fwd_map to each entry - Filter out NULL-type atoms (fwd_map[orig_idx] == -1) - Build remapped_sendnum_data and remapped_sendlist_data (persistent class members to back the tensors and avoid use-after-free) - Always pass comm_dict to forward_lower when do_message_passing For single-process tests (nswap=0), total_send=0 so the remapping is a no-op. For multi-process with NULL atoms, the sendlist is now correctly in real-space. Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * fix: add bounds check for orig_idx before fwd_map access in sendlist remapping Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * Delete _codeql_detected_source_root Signed-off-by: link89 <xuweihong.cn@gmail.com> * Delete source/source Signed-off-by: link89 <xuweihong.cn@gmail.com> * fix(api_cc): apply suggested patch - use torch::tensor() and always remap sendlist Key fixes following @link89's patch suggestion: 1. lmp_list.sendlist is int** (jagged array per swap), NOT a flat buffer. The previous code using from_blob(lmp_list.sendlist, {total_send}, ...) was treating the int** pointer-to-pointers as a flat int array - wrong even for the non-NULL case. 2. Always remap sendlist through fwd_map (not just when has_null_atoms), correctly iterating sendlist[iswap] per swap. 3. Use torch::tensor() to copy data into tensors, avoiding any lifetime issues with from_blob. 4. Rename buffers to mp_sendlist_remap_ / mp_sendnum_remap_ (int32_t) matching the suggested naming convention. 5. Add #include <cstdint> to DeepPotPT.h and #include <numeric> to .cc Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * debug: remove all code fixes, keep diagnostic logs only Per @link89's request: revert all code fixes so we can observe the raw crash behavior with the diagnostic logs in place. Reverted: - Removed sendlist remap loop (back to original from_blob(lmp_list.sendlist,...)) - Removed mapping_data persistent member (back to local vector mapping) - Reverted mapping index back to original: mapping[ii] = lmp_list.mapping[fwd_map[ii]] - Removed mp_sendlist_remap_ and mp_sendnum_remap_ class members from header - Removed #include <cstdint> from header (not in original) Kept/added diagnostic logs: - has_null_atoms flag and NULL atom count logging - total_send logging - DIAG: sendlist[s][0] per swap showing first entry vs nall_real - DIAG: first 5 mapping values showing fwd_map[ii] used - nlist copy/shuffle/padding checkpoints - forward_lower call/return checkpoints - bkw mapping force checkpoint Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * log full fwd_map Signed-off-by: link89 <xuweihong.cn@gmail.com> * debug: print all real/virtual atom indices and full comm_dict contents Per @link89's request: add full diagnostic dumps of: 1. All original atom indices with their atype, fwd_map value, and REAL/VIRTUAL label 2. All bkw_map entries showing real->original mapping 3. Full comm_dict contents: send_proc, recv_proc, send_num, recv_num, firstrecv 4. Per-swap sendlist with all entries, annotated with (valid) or (OOB!) relative to nall_real Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * debug: add try-catch around forward_lower and print exception details before re-throw Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * debug: print mapping_tensor shape and values before forward_lower call Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * debug: add logging to Python forward_lower to print atype/nlist/mapping/comm_dict Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * fix(api_cc): fix compile error - use has_value()/value() for c10::optional mapping_tensor Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * fix(api_cc): fix Dict<IValue,IValue> default-construction error - use lambda for try-catch around forward_lower Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * feat(api_cc): add make_comm_dict method to remap sendlist from original to real-atom space Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * fix(api_cc): simplify fwd_map lookup in make_comm_dict per review feedback Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * update recvsum for virtual atoms * update comm_dict directly * update comm_dict directly * fix memory lifecycle issue * simplify memory mamangment * simplify memory mamangment * fix memory issue and add more logs * fix segmenet fault * optimize memory usage * refactor(api_cc,pt): remove debug logs and split update_comm_dict for performance Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * remove log in test case * remove _codeql_detected_source_root * refactor duplicated code * refactor duplicated code * fix(tf): fix MessageToString float_format compatibility with protobuf 5.x Co-authored-by: link89 <3314130+link89@users.noreply.github.com> * revert(tf): revert MessageToString fix in convert.py Co-authored-by: link89 <3314130+link89@users.noreply.github.com> --------- Signed-off-by: link89 <xuweihong.cn@gmail.com> Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: link89 <3314130+link89@users.noreply.github.com> Co-authored-by: weihong.xu <xuweihong.cn@gmail.com>

for more information, see https://pre-commit.ci

link89 · 2026-02-27T13:09:43Z

Hi @iProzd @njzjz , this PR is intended to address the issue reported in #4749. I have implemented the fix specifically within DeepPotPT.cpp and would appreciate a priority review and merge. I am leaving the remaining implementations (such as DeepSpinPT.cpp) open for others to address, as I lack the domain knowledge required to properly test those specific modules.

iProzd · 2026-03-04T09:17:25Z

I'll close this PR. See #4889 for details.

link89 added 2 commits February 25, 2026 16:45

fix: set NULL type to -1

e0ef6dc

test(dpa): add test for lammps NULL type

05d7bef

Copilot AI review requested due to automatic review settings February 25, 2026 08:53

dosubot bot added the bug label Feb 25, 2026

github-actions bot added the LAMMPS label Feb 25, 2026

coderabbitai bot reviewed Feb 25, 2026

View reviewed changes

source/lmp/tests/test_lammps_dpa_pt.py Show resolved Hide resolved

[pre-commit.ci] auto fixes from pre-commit.com hooks

1a51465

for more information, see https://pre-commit.ci

Copilot started reviewing on behalf of link89 February 25, 2026 08:58 View session

Copilot AI reviewed Feb 25, 2026

View reviewed changes

link89 added 2 commits February 25, 2026 17:26

test(dpa): test NULL type with hybrid potential

d61ffbb

test: update test for lammps null type

dc6b163

Signed-off-by: link89 <xuweihong.cn@gmail.com>

coderabbitai bot reviewed Feb 25, 2026

View reviewed changes

source/lmp/tests/test_lammps_dpa_pt.py Show resolved Hide resolved

test: add test for lammps null type

ef8b152

Signed-off-by: link89 <xuweihong.cn@gmail.com>

coderabbitai bot reviewed Feb 25, 2026

View reviewed changes

link89 and others added 5 commits February 25, 2026 23:08

test: use lammps hybrid potential to test NULL type

ec3b878

Signed-off-by: link89 <xuweihong.cn@gmail.com>

test: use lammps hybrid potential to test NULL type

c051764

Signed-off-by: link89 <xuweihong.cn@gmail.com>

test: use lammps hybrid potential to test NULL type

7adaf83

Signed-off-by: link89 <xuweihong.cn@gmail.com>

fix: misleading comment

fbece7b

github-actions bot added the C++ label Feb 27, 2026

[pre-commit.ci] auto fixes from pre-commit.com hooks

87d7b33

for more information, see https://pre-commit.ci

njzjz requested review from iProzd, njzjz and wanghan-iapcm February 28, 2026 19:09

iProzd added a commit to iProzd/deepmd-kit that referenced this pull request Mar 3, 2026

update fix, ref to deepmodeling#5268

5875158

iProzd mentioned this pull request Mar 3, 2026

fix(c++): fix NULL type in custom op #4889

Merged

iProzd closed this Mar 4, 2026

Conversation

link89 commented Feb 25, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Feb 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Estimated code review effort

Suggested labels

Suggested reviewers

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Feb 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

link89 commented Feb 27, 2026

Uh oh!

iProzd commented Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

link89 commented Feb 25, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 25, 2026 •

edited

Loading

codecov bot commented Feb 25, 2026 •

edited

Loading