Add low precision attention API from torchao to TorchAoConfig by howardzhang-cv · Pull Request #13285 · huggingface/diffusers

howardzhang-cv · 2026-03-19T00:41:59Z

What does this PR do?

Adds low precision attention API from TorchAO to diffusers by updating TorchAoConfig with attn_backend option.
Note: this will require torchao 0.17.0

Need to add new necessary tests
Need to update the documentation:
- documentation guidelines
- here are tips on formatting docstrings.

Results are on flux.1-dev with 2048x2048 image size

Add low precision attention API from torchao to TorchAoConfig

125a7a8