[ET Device Support] Define et_copy runtime h2d and d2h copy ops#18729
[ET Device Support] Define et_copy runtime h2d and d2h copy ops#18729Gasoonjia wants to merge 7 commits into
Conversation
Implement C++ runtime kernels for device copy ops using DeviceAllocator: - h2d_copy_out: infers device from out tensor, calls DeviceAllocator::copy_host_to_device - d2h_copy_out: infers device from self tensor, calls DeviceAllocator::copy_device_to_host - Registered via EXECUTORCH_LIBRARY macro Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/) [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18729
Note: Links to docs will display an error until the docs builds have been completed. ❌ 20 New Failures, 2 Cancelled Jobs, 1 Pending, 3 Unrelated Failures, 1 Unclassified FailureAs of commit 8a0b52e with merge base a83e7c4 ( NEW FAILURES - The following jobs have failed:
UNCLASSIFIED FAILURE - DrCI could not classify the following job because the workflow did not run on the merge base. The failure may be pre-existing on trunk or introduced by this PR:
CANCELLED JOBS - The following jobs were cancelled. Please retry:
FLAKY - The following job failed but was likely due to flakiness present on trunk:
BROKEN TRUNK - The following jobs failed but was present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This PR needs a
|
…y ops" Implement C++ runtime kernels for device copy ops using DeviceAllocator: - h2d_copy_out: infers device from out tensor, calls DeviceAllocator::copy_host_to_device - d2h_copy_out: infers device from self tensor, calls DeviceAllocator::copy_device_to_host - Registered via EXECUTORCH_LIBRARY macro Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/) [ghstack-poisoned]
digantdesai
left a comment
There was a problem hiding this comment.
Review automatically exported from Phabricator review in Meta.
…y ops" Implement C++ runtime kernels for device copy ops using DeviceAllocator: - h2d_copy_out: infers device from out tensor, calls DeviceAllocator::copy_host_to_device - d2h_copy_out: infers device from self tensor, calls DeviceAllocator::copy_device_to_host - Registered via EXECUTORCH_LIBRARY macro Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/) [ghstack-poisoned]
Pull Request resolved: #18729 Implement C++ runtime kernels for device copy ops using DeviceAllocator: - h2d_copy_out: infers device from out tensor, calls DeviceAllocator::copy_host_to_device - d2h_copy_out: infers device from self tensor, calls DeviceAllocator::copy_device_to_host - Registered via EXECUTORCH_LIBRARY macro ghstack-source-id: 383670667 @exported-using-ghexport Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/)
Pull Request resolved: #18729 Implement C++ runtime kernels for device copy ops using DeviceAllocator: - h2d_copy_out: infers device from out tensor, calls DeviceAllocator::copy_host_to_device - d2h_copy_out: infers device from self tensor, calls DeviceAllocator::copy_device_to_host - Registered via EXECUTORCH_LIBRARY macro ghstack-source-id: 383670667 @exported-using-ghexport Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/)
Pull Request resolved: #18729 Implement C++ runtime kernels for device copy ops using DeviceAllocator: - h2d_copy_out: infers device from out tensor, calls DeviceAllocator::copy_host_to_device - d2h_copy_out: infers device from self tensor, calls DeviceAllocator::copy_device_to_host - Registered via EXECUTORCH_LIBRARY macro ghstack-source-id: 384965089 @exported-using-ghexport Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/)
Pull Request resolved: #18729 Implement C++ runtime kernels for device copy ops using DeviceAllocator: - h2d_copy_out: infers device from out tensor, calls DeviceAllocator::copy_host_to_device - d2h_copy_out: infers device from self tensor, calls DeviceAllocator::copy_device_to_host - Registered via EXECUTORCH_LIBRARY macro ghstack-source-id: 385346808 @exported-using-ghexport Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/)
Stack from ghstack (oldest at bottom):
Implement C++ runtime kernels for device copy ops using DeviceAllocator:
DeviceAllocator::copy_host_to_device
DeviceAllocator::copy_device_to_host
Differential Revision: D99636776