Skip to content

Swap to using 'get_node_ids' that may be sharded or fetch a split, etc#467

Merged
kmontemayor2-sc merged 11 commits intomainfrom
kmonte/get-node-ids
Feb 3, 2026
Merged

Swap to using 'get_node_ids' that may be sharded or fetch a split, etc#467
kmontemayor2-sc merged 11 commits intomainfrom
kmonte/get-node-ids

Conversation

@kmontemayor2-sc
Copy link
Copy Markdown
Collaborator

@kmontemayor2-sc kmontemayor2-sc commented Jan 27, 2026

Scope of work done

Per discussion in #438, migrating to some get_node_ids which is more flexible and can fetch a split and optionally shard, etc.

Additionally, breaking out the dataset building utils to their own (tested) file, tests/test_assets/distributed/test_dataset.py and then creating remote_dist_dataset_test.py as unit tests for RDI.

Where is the documentation for this feature?: N/A

Did you add automated tests or write a test plan?

Updated Changelog.md? NO

Ready for code review?: NO

@kmontemayor2-sc
Copy link
Copy Markdown
Collaborator Author

/unit_test_py

@kmontemayor2-sc
Copy link
Copy Markdown
Collaborator Author

/integration_test

@kmontemayor2-sc
Copy link
Copy Markdown
Collaborator Author

/e2e_test

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jan 27, 2026

GiGL Automation

@ 17:56:25UTC : 🔄 Python Unit Test started.

@ 19:10:59UTC : ✅ Workflow completed successfully.

@github-actions
Copy link
Copy Markdown
Contributor

GiGL Automation

@ 17:56:36UTC : 🔄 Integration Test started.

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jan 27, 2026

GiGL Automation

@ 17:56:55UTC : 🔄 E2E Test started.

@ 19:17:55UTC : ✅ Workflow completed successfully.

@kmontemayor2-sc
Copy link
Copy Markdown
Collaborator Author

/unit_test_py

@kmontemayor2-sc
Copy link
Copy Markdown
Collaborator Author

/integration_test

@kmontemayor2-sc
Copy link
Copy Markdown
Collaborator Author

/e2e_test

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jan 28, 2026

GiGL Automation

@ 23:31:16UTC : 🔄 Python Unit Test started.

@ 24:39:48UTC : ✅ Workflow completed successfully.

@github-actions
Copy link
Copy Markdown
Contributor

GiGL Automation

@ 23:31:19UTC : 🔄 Integration Test started.

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jan 28, 2026

GiGL Automation

@ 23:31:21UTC : 🔄 E2E Test started.

@ 24:55:47UTC : ✅ Workflow completed successfully.

@kmontemayor2-sc
Copy link
Copy Markdown
Collaborator Author

/unit_test_py

@kmontemayor2-sc
Copy link
Copy Markdown
Collaborator Author

/integration_test

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jan 28, 2026

GiGL Automation

@ 23:32:48UTC : 🔄 Python Unit Test started.

@ 24:51:43UTC : ✅ Workflow completed successfully.

@github-actions
Copy link
Copy Markdown
Contributor

GiGL Automation

@ 23:32:52UTC : 🔄 Integration Test started.

@kmontemayor2-sc
Copy link
Copy Markdown
Collaborator Author

/unit_test_py

@kmontemayor2-sc
Copy link
Copy Markdown
Collaborator Author

/integration_test

@kmontemayor2-sc
Copy link
Copy Markdown
Collaborator Author

/e2e_test

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jan 29, 2026

GiGL Automation

@ 18:21:41UTC : 🔄 Python Unit Test started.

@ 19:40:00UTC : ✅ Workflow completed successfully.

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jan 29, 2026

GiGL Automation

@ 18:21:49UTC : 🔄 E2E Test started.

@ 19:43:14UTC : ✅ Workflow completed successfully.

@github-actions
Copy link
Copy Markdown
Contributor

GiGL Automation

@ 18:21:52UTC : 🔄 Integration Test started.

Copy link
Copy Markdown
Collaborator

@mkolodner-sc mkolodner-sc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks Kyle! Left a few small comments, generally LGTM.

In the future, it might be easier to review this if the testing utility changes were moved to a separate PR from the get_node_ids change here.

Comment thread gigl/distributed/graph_store/remote_dist_dataset.py
Comment thread gigl/distributed/graph_store/storage_utils.py
Comment thread gigl/distributed/graph_store/storage_utils.py
Comment thread tests/test_assets/distributed/test_dataset.py
Comment thread tests/test_assets/distributed/test_dataset.py Outdated
Comment thread tests/test_assets/distributed/test_dataset.py Outdated
Comment thread tests/test_assets/distributed/test_dataset.py Outdated
Copy link
Copy Markdown
Collaborator

@svij-sc svij-sc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approved from an API change perspective.
Provided some suggestions for further improvement.

Will leave it to @mkolodner-sc to review the details / implementation.

Comment thread gigl/distributed/graph_store/remote_dist_dataset.py
Comment thread gigl/distributed/graph_store/storage_utils.py
Copy link
Copy Markdown
Collaborator

@mkolodner-sc mkolodner-sc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This LGTM, thanks!

@kmontemayor2-sc
Copy link
Copy Markdown
Collaborator Author

🆒

@kmontemayor2-sc kmontemayor2-sc marked this pull request as ready for review February 3, 2026 19:05
@kmontemayor2-sc kmontemayor2-sc added this pull request to the merge queue Feb 3, 2026
@github-merge-queue github-merge-queue Bot removed this pull request from the merge queue due to failed status checks Feb 3, 2026
@kmontemayor2-sc kmontemayor2-sc added this pull request to the merge queue Feb 3, 2026
Merged via the queue into main with commit 7a028db Feb 3, 2026
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants