Skip to content

[ET Device Support] Define et_copy runtime h2d and d2h copy ops#18729

Open
Gasoonjia wants to merge 7 commits into
gh/gasoonjia/159/basefrom
gh/gasoonjia/159/head
Open

[ET Device Support] Define et_copy runtime h2d and d2h copy ops#18729
Gasoonjia wants to merge 7 commits into
gh/gasoonjia/159/basefrom
gh/gasoonjia/159/head

Conversation

@Gasoonjia
Copy link
Copy Markdown
Contributor

@Gasoonjia Gasoonjia commented Apr 6, 2026

Implement C++ runtime kernels for device copy ops using DeviceAllocator:
- h2d_copy_out: infers device from out tensor, calls
  DeviceAllocator::copy_host_to_device
- d2h_copy_out: infers device from self tensor, calls
  DeviceAllocator::copy_device_to_host
- Registered via EXECUTORCH_LIBRARY macro

Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/)

[ghstack-poisoned]
@meta-cla meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 6, 2026
@pytorch-bot
Copy link
Copy Markdown

pytorch-bot Bot commented Apr 6, 2026

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18729

Note: Links to docs will display an error until the docs builds have been completed.

❌ 20 New Failures, 2 Cancelled Jobs, 1 Pending, 3 Unrelated Failures, 1 Unclassified Failure

As of commit 8a0b52e with merge base a83e7c4 (image):

NEW FAILURES - The following jobs have failed:

UNCLASSIFIED FAILURE - DrCI could not classify the following job because the workflow did not run on the merge base. The failure may be pre-existing on trunk or introduced by this PR:

  • Check Labels / Check labels (gh) (this job did not run on the merge base, so DrCI cannot tell whether the failure is pre-existing)
    RuntimeError: GraphQL query

CANCELLED JOBS - The following jobs were cancelled. Please retry:

FLAKY - The following job failed but was likely due to flakiness present on trunk:

BROKEN TRUNK - The following jobs failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@github-actions
Copy link
Copy Markdown

github-actions Bot commented Apr 6, 2026

This PR needs a release notes: label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

…y ops"

Implement C++ runtime kernels for device copy ops using DeviceAllocator:
- h2d_copy_out: infers device from out tensor, calls
  DeviceAllocator::copy_host_to_device
- d2h_copy_out: infers device from self tensor, calls
  DeviceAllocator::copy_device_to_host
- Registered via EXECUTORCH_LIBRARY macro

Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/)

[ghstack-poisoned]
Copy link
Copy Markdown
Contributor

@digantdesai digantdesai left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Review automatically exported from Phabricator review in Meta.

…y ops"

Implement C++ runtime kernels for device copy ops using DeviceAllocator:
- h2d_copy_out: infers device from out tensor, calls
  DeviceAllocator::copy_host_to_device
- d2h_copy_out: infers device from self tensor, calls
  DeviceAllocator::copy_device_to_host
- Registered via EXECUTORCH_LIBRARY macro

Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/)

[ghstack-poisoned]
[ghstack-poisoned]
Gasoonjia added a commit that referenced this pull request May 18, 2026
Pull Request resolved: #18729

Implement C++ runtime kernels for device copy ops using DeviceAllocator:
- h2d_copy_out: infers device from out tensor, calls
  DeviceAllocator::copy_host_to_device
- d2h_copy_out: infers device from self tensor, calls
  DeviceAllocator::copy_device_to_host
- Registered via EXECUTORCH_LIBRARY macro
ghstack-source-id: 383670667
@exported-using-ghexport

Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/)
[ghstack-poisoned]
Gasoonjia added a commit that referenced this pull request May 21, 2026
Pull Request resolved: #18729

Implement C++ runtime kernels for device copy ops using DeviceAllocator:
- h2d_copy_out: infers device from out tensor, calls
  DeviceAllocator::copy_host_to_device
- d2h_copy_out: infers device from self tensor, calls
  DeviceAllocator::copy_device_to_host
- Registered via EXECUTORCH_LIBRARY macro
ghstack-source-id: 383670667
@exported-using-ghexport

Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/)
[ghstack-poisoned]
Gasoonjia added a commit that referenced this pull request May 21, 2026
Pull Request resolved: #18729

Implement C++ runtime kernels for device copy ops using DeviceAllocator:
- h2d_copy_out: infers device from out tensor, calls
  DeviceAllocator::copy_host_to_device
- d2h_copy_out: infers device from self tensor, calls
  DeviceAllocator::copy_device_to_host
- Registered via EXECUTORCH_LIBRARY macro
ghstack-source-id: 384965089
@exported-using-ghexport

Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/)
[ghstack-poisoned]
Gasoonjia added a commit that referenced this pull request May 22, 2026
Pull Request resolved: #18729

Implement C++ runtime kernels for device copy ops using DeviceAllocator:
- h2d_copy_out: infers device from out tensor, calls
  DeviceAllocator::copy_host_to_device
- d2h_copy_out: infers device from self tensor, calls
  DeviceAllocator::copy_device_to_host
- Registered via EXECUTORCH_LIBRARY macro
ghstack-source-id: 385346808
@exported-using-ghexport

Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/trunk CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported meta-exported

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants