[WIP] Move XNNPACKQuantizer from PyTorch to ExecuTorch #144940

digantdesai · 2025-01-16T05:16:07Z

Summary:
This replicates XNNPACKQuantizer from PyTorch to ExecuTorch.

Rationale:
Main motivation is to avoid pytorch pin update in OSS after updating XNNPACKQuantizer, which can be rather frequent.
Other impact and considerations:
PT2e flow (which lives in PyTorch) relies havily on XNNPACKQuantizer for a "example" implementation for quantizer and more importantly tests. Fow now, we will keep the torch.ao.quantization.xnnpack_quantizer as is but mark is as not BC, and deprecated to discourace future new dependencies on it.
Other OSS repository using XNNPACKQuantizer from PyTorch now have to take an additional dependency on ExecuTorch.

Differential Revision: D68191752

cc @albanD @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @chenyang78 @kadeng @chauhang @amjames

pytorch-bot · 2025-01-16T05:16:11Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/144940

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 5a7bfc9 with merge base b2c89bc ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-01-16T05:16:29Z

This pull request was exported from Phabricator. Differential Revision: D68191752

jerryzh168 · 2025-01-16T05:18:33Z

test/quantization/pt2e/test_representation.py

cc @kimishpatel feels like reference representation is not really used, should we delete?

WHy is this test being deleted entirely though? I am guessing because xnnpack quantizer is moving hence? I would have to think a bit on removing reference representation, and we should discuss. Potentially we remove those in favor providing an API that will use customer defined representation

yeah this test is moved to executorch I think. sounds good, custom defined representation makes sense as well

I am thinking of adding a dummy_quantizer in the test dir to run these tests especially qat ones.

jerryzh168 · 2025-01-16T05:19:01Z

torch/ao/quantization/pt2e/_numeric_debugger.py

@@ -31,7 +31,7 @@ def generate_numeric_debug_handle(ep: ExportedProgram) -> None:
        generate_numeric_debug_handle(ep)

        m = ep.module()
-        quantizer = XNNPACKQuantizer()
+        quantizer = XNNPACKQuantizer() # In the ExecuTorch repo.


please add an import for this

what do you mean? This is an example, do you mean to put import line in addition to the comment in the doc string?

yeah.. just to make it clearer since it's in executorch now

jerryzh168 · 2025-01-16T05:20:17Z

torch/ao/quantization/quantizer/x86_inductor_quantizer.py

@@ -64,6 +58,73 @@
    "get_x86_inductor_linear_dynamic_fp16_config",
 ]

+# In the absence of better name, just winging it with QuantizationConfig
+@dataclass(eq=True, frozen=True)
+class QuantizationConfig:


we can probably put these in quantizer/utils.py since multiple quantizers are using it, but should be clear that there is no requirement of quantizers using them

I remember we discussed a bit before @kimishpatel and you were pushing for having these as utils, I feel it's fine to have them in utils as long as we are clear that it's more of a convenience function and not required to use now

I can move them, but IIRC no one else is using it so kept it here.

albanD

Simplification!!
skipping pr sanity check is obviously ok for this one!

albanD · 2025-01-16T18:58:32Z

docs/source/conf.py

@@ -359,7 +359,6 @@
    "prepare_qat_pt2e",
    # torch.ao.quantization.quantizer.embedding_quantizer
    "get_embedding_operators_config",
-    # torch.ao.quantization.quantizer.xnnpack_quantizer_utils


Everything below this comment until the next comed from this namespace. So you can delete all these lines as well

albanD · 2025-01-16T19:00:28Z

torch/ao/quantization/quantizer/embedding_quantizer.py


 __all__ = [
    "get_embedding_operators_config",
    "EmbeddingQuantizer",
 ]

+# In the absence of better name, just winging it with QuantizationConfig
+@dataclass(eq=True, frozen=True)


These need to be added to the __all__ block above and be properly documented on the doc website if you want to move them here as public API.

facebook-github-bot · 2025-01-17T03:26:12Z

This pull request was exported from Phabricator. Differential Revision: D68191752

Summary: Pull Request resolved: pytorch#144940 X-link: pytorch/ao#1572 This migrates XNNPACKQuantizer from PyTorch to ExecuTorch. Rationale: Main motivation is to avoid pytorch pin update in OSS after updating XNNPACKQuantizer, which can be rather frequent. Other impact and considerations: - PT2e flow (which lives in PyTorch) relies havily on XNNPACKQuantizer for a "example" implementation for quantizer and more importantly tests. Fow now, after talking with jerryzh168 - this diff moves most of the tests to ExecuTorch/backends/xnnpack/test/quantizer. - Other OSS repository using XNNPACKQuantizer from PyTorch now have to take an additional dependency on ExecuTorch. Differential Revision: D68191752

torch/ao/quantization/quantizer/xnnpack_quantizer.py

jerryzh168

please update the PR summary as well about the migration plan

facebook-github-bot · 2025-01-17T21:09:57Z

This pull request was exported from Phabricator. Differential Revision: D68191752

Summary: Pull Request resolved: pytorch#7804 X-link: pytorch/pytorch#144940 This migrates XNNPACKQuantizer from PyTorch to ExecuTorch. Rationale: Main motivation is to avoid pytorch pin update in OSS after updating XNNPACKQuantizer, which can be rather frequent. Other impact and considerations: - PT2e flow (which lives in PyTorch) relies havily on XNNPACKQuantizer for a "example" implementation for quantizer and more importantly tests. Fow now, we will keep the torch.ao.quantization.xnnpack_quantizer as is but mark is as not BC, and deprecated to discourace future new dependencies on it. - Other OSS repository using XNNPACKQuantizer from PyTorch now have to take an additional dependency on ExecuTorch. Reviewed By: mcr229 Differential Revision: D68191752

facebook-github-bot · 2025-01-23T22:42:05Z

This pull request was exported from Phabricator. Differential Revision: D68191752

Summary: Pull Request resolved: pytorch#7804 X-link: pytorch/pytorch#144940 This migrates XNNPACKQuantizer from PyTorch to ExecuTorch. Rationale: Main motivation is to avoid pytorch pin update in OSS after updating XNNPACKQuantizer, which can be rather frequent. Other impact and considerations: - PT2e flow (which lives in PyTorch) relies havily on XNNPACKQuantizer for a "example" implementation for quantizer and more importantly tests. Fow now, we will keep the torch.ao.quantization.xnnpack_quantizer as is but mark is as not BC, and deprecated to discourace future new dependencies on it. - Other OSS repository using XNNPACKQuantizer from PyTorch now have to take an additional dependency on ExecuTorch. Reviewed By: mcr229 Differential Revision: D68191752

facebook-github-bot · 2025-01-23T22:58:13Z

This pull request was exported from Phabricator. Differential Revision: D68191752

Summary: X-link: pytorch/executorch#7804 Pull Request resolved: pytorch#144940 This migrates XNNPACKQuantizer from PyTorch to ExecuTorch. Rationale: Main motivation is to avoid pytorch pin update in OSS after updating XNNPACKQuantizer, which can be rather frequent. Other impact and considerations: - PT2e flow (which lives in PyTorch) relies havily on XNNPACKQuantizer for a "example" implementation for quantizer and more importantly tests. Fow now, we will keep the torch.ao.quantization.xnnpack_quantizer as is but mark is as not BC, and deprecated to discourace future new dependencies on it. - Other OSS repository using XNNPACKQuantizer from PyTorch now have to take an additional dependency on ExecuTorch. Test Plan: CI Reviewed By: mcr229 Differential Revision: D68191752

Summary: Pull Request resolved: pytorch#7804 X-link: pytorch/pytorch#144940 This migrates XNNPACKQuantizer from PyTorch to ExecuTorch. Rationale: Main motivation is to avoid pytorch pin update in OSS after updating XNNPACKQuantizer, which can be rather frequent. Other impact and considerations: - PT2e flow (which lives in PyTorch) relies havily on XNNPACKQuantizer for a "example" implementation for quantizer and more importantly tests. Fow now, we will keep the torch.ao.quantization.xnnpack_quantizer as is but mark is as not BC, and deprecated to discourace future new dependencies on it. - Other OSS repository using XNNPACKQuantizer from PyTorch now have to take an additional dependency on ExecuTorch. Reviewed By: mcr229 Differential Revision: D68191752

facebook-github-bot · 2025-01-24T00:57:02Z

This pull request was exported from Phabricator. Differential Revision: D68191752

facebook-github-bot · 2025-01-24T07:08:49Z

@pytorchbot merge

(Initiating merge automatically since Phabricator Diff has merged)

pytorchmergebot · 2025-01-24T07:10:48Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

digantdesai requested review from jerryzh168 and albanD as code owners January 16, 2025 05:16

pytorch-bot bot added ciflow/inductor module: dynamo release notes: quantization release notes category labels Jan 16, 2025

facebook-github-bot added the fb-exported label Jan 16, 2025

jerryzh168 reviewed Jan 16, 2025

View reviewed changes

albanD reviewed Jan 16, 2025

View reviewed changes

albanD added the skip-pr-sanity-checks label Jan 16, 2025

digantdesai force-pushed the export-D68191752 branch from 76818cb to a175859 Compare January 17, 2025 03:26

kimishpatel reviewed Jan 17, 2025

View reviewed changes

torch/ao/quantization/quantizer/xnnpack_quantizer.py Show resolved Hide resolved

pytorch-bot < 6D47 span class="Label Label--secondary">bot temporarily deployed to upload-benchmark-results January 17, 2025 03:55 Inactive

pytorch-bot bot temporarily deployed to upload-benchmark-results January 17, 2025 03:55 Inactive

jerryzh168 approved these changes Jan 17, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jan 17, 2025

pytorch-bot bot temporarily deployed to upload-benchmark-results January 17, 2025 04:38 Inactive

digantdesai force-pushed the export-D68191752 branch from a175859 to 6955b5f Compare January 17, 2025 21:10

pytorch-bot bot temporarily deployed to upload-benchmark-results January 17, 2025 22:16 Inactive

pytorch-bot bot had a problem deploying to upload-benchmark-results January 17, 2025 22:16 Error

digantdesai force-pushed the export-D68191752 branch from efd0614 to 13be4ac Compare January 23, 2025 21:05

pytorch-bot bot temporarily deployed to upload-benchmark-results January 23, 2025 22:20 Inactive

pytorch-bot bot had a problem deploying to upload-benchmark-results January 23, 2025 22:22 Error

digantdesai force-pushed the export-D68191752 branch from 13be4ac to a401071 Compare January 23, 2025 22:42

digantdesai force-pushed the export-D68191752 branch from a401071 to 4698bda Compare January 23, 2025 22:58

digantdesai force-pushed the export-D68191752 branch from 4698bda to 5a7bfc9 Compare January 24, 2025 00:57

pytorch-bot bot temporarily deployed to upload-benchmark-results January 24, 2025 02:00 Inactive

pytorch-bot bot temporarily deployed to upload-benchmark-results January 24, 2025 02:01 Inactive

pytorchmergebot added the merging label Jan 24, 2025

pytorchmergebot added the Merged label Jan 24, 2025

pytorchmergebot closed this in f08b9bc Jan 24, 2025

pytorchmergebot removed the merging label Jan 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Move XNNPACKQuantizer from PyTorch to ExecuTorch #144940

[WIP] Move XNNPACKQuantizer from PyTorch to ExecuTorch #144940

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[WIP] Move XNNPACKQuantizer from PyTorch to ExecuTorch #144940

[WIP] Move XNNPACKQuantizer from PyTorch to ExecuTorch #144940

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/144940

✅ No Failures

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Merge started

Uh oh!

Uh oh!