[ET-VK][ez] Add AOT support for PackedInt8_4C1W dtype by SS-JIA · Pull Request #17389 · pytorch/executorch

SS-JIA · 2026-02-11T20:15:49Z

Stack from ghstack (oldest at bottom):

This adds end-to-end support for the PackedInt8_4C1W memory layout throughout the serialization and AOT pipeline. The 4C1W layout packs 4 channels into a single texel with width-major ordering, which is the natural output layout for convolutions that produce channel-packed results.

Adds PACKED_INT8_4C1W = 8 to the FlatBuffers schema and Python schema class
Adds deserialization mapping in VulkanBackend.cpp
Updates quantize/dequantize per-tensor op registrations to accept any PackedInt8 layout (not just 4W4C), enabling the layout propagation pass to choose the optimal layout
Adds new TensorRepSet constants: PACKED_INT8_BUFFER (all quantized layouts), PACKED_INT8_4C1W_BUFFER, and PACKED_INT8_CHANNELS_PACKED_BUFFER (4W4C + 4C1W)

Differential Revision: D93000167

This adds end-to-end support for the PackedInt8_4C1W memory layout throughout the serialization and AOT pipeline. The 4C1W layout packs 4 channels into a single texel with width-major ordering, which is the natural output layout for convolutions that produce channel-packed results. - Adds PACKED_INT8_4C1W = 8 to the FlatBuffers schema and Python schema class - Adds deserialization mapping in VulkanBackend.cpp - Updates quantize/dequantize per-tensor op registrations to accept any PackedInt8 layout (not just 4W4C), enabling the layout propagation pass to choose the optimal layout - Adds new TensorRepSet constants: PACKED_INT8_BUFFER (all quantized layouts), PACKED_INT8_4C1W_BUFFER, and PACKED_INT8_CHANNELS_PACKED_BUFFER (4W4C + 4C1W) Differential Revision: [D93000167](https://our.internmc.facebook.com/intern/diff/D93000167/) [ghstack-poisoned]

pytorch-bot · 2026-02-11T20:15:54Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17389

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 2 Unrelated Failures

As of commit 32015e1 with merge base 964c565 ():

NEW FAILURES - The following jobs have failed:

pull / unittest-editable / macos / macos-job (gh)
export/tests/test_target_recipes.py::TestTargetRecipes::test_resnet50_model
Test CUDA Builds / test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, quantized-int4-tile-packed) / linux-job (gh)
RuntimeError: Command docker exec -t 912a0445efe356f613a1ba6f7924f531a1da8ea30ac692bc295e7c66e7d612d8 /exec failed with exit code 1

FLAKY - The following job failed but was likely due to flakiness present on trunk:

periodic / test-models-linux (buck2, mv3, portable, linux.2xlarge, 90) / linux-job (gh) (detected as infra flaky with no log or failing log classifier)

BROKEN TRUNK - The following job failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / test-samsung-models-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-02-11T20:17:31Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

This adds end-to-end support for the PackedInt8_4C1W memory layout throughout the serialization and AOT pipeline. The 4C1W layout packs 4 channels into a single texel with width-major ordering, which is the natural output layout for convolutions that produce channel-packed results. - Adds PACKED_INT8_4C1W = 8 to the FlatBuffers schema and Python schema class - Adds deserialization mapping in VulkanBackend.cpp - Updates quantize/dequantize per-tensor op registrations to accept any PackedInt8 layout (not just 4W4C), enabling the layout propagation pass to choose the optimal layout - Adds new TensorRepSet constants: PACKED_INT8_BUFFER (all quantized layouts), PACKED_INT8_4C1W_BUFFER, and PACKED_INT8_CHANNELS_PACKED_BUFFER (4W4C + 4C1W) Differential Revision: [D93000167](https://our.internmc.facebook.com/intern/diff/D93000167/) [ghstack-poisoned]

This was referenced Feb 11, 2026

[ET-VK][qconv][ez] Make q8ta_im2col shader support stride_w != 1 #17386

Open

[ET-VK][qconv] Dynamically select between im2col path and general path #17387

Open

[ET-VK] Migrate to use new q8ta_conv2d ops #17388

Open

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 11, 2026

meta-codesync bot added fb-exported meta-exported labels Feb 11, 2026

SS-JIA mentioned this pull request Feb 11, 2026

Back out "[Diff Train][pytorch/executorch] Apply fixup patch to fbsource" #17399

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ET-VK][ez] Add AOT support for PackedInt8_4C1W dtype#17389

[ET-VK][ez] Add AOT support for PackedInt8_4C1W dtype#17389
SS-JIA wants to merge 3 commits intogh/SS-JIA/420/basefrom
gh/SS-JIA/420/head

SS-JIA commented Feb 11, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Feb 11, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

SS-JIA commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17389

❌ 2 New Failures, 2 Unrelated Failures

Uh oh!

github-actions bot commented Feb 11, 2026

This PR needs a release notes: label

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

SS-JIA commented Feb 11, 2026 •

edited

Loading

pytorch-bot bot commented Feb 11, 2026 •

edited

Loading

This PR needs a `release notes:` label