[Intel GPU][pt2e]: Collapse 3D input to 2D for matmul in qlinear_pointwise_binary fusion #148423

ZhiweiYan-96 · 2025-03-04T05:57:45Z

Motivation

During the qlinear_pointwise_binary lowering pass, dim collapsing only occurs when post-ops is add. It is the responsibility of C++ kernels to handle dimension for post-ops sum

Details

This PR explicitly reshape input from 3D to 2D in op qlinear_pointwise_binary. Besides, we refractor implementation qlinear_pointwise_binary.tensor to call qlinear_pointwise_binary for removing duplicated codes.

UT testing

python test/inductor/test_mkldnn_pattern_matcher.py -k test_qlienar_add_xpu

Stack from ghstack (oldest at bottom):

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10 @voznesenskym @penguinwu @EikanWang @Guobing-Chen @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2025-03-04T05:57:49Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/148423

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 9753ff4 with merge base b3bb73e ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…wise_binary fusion ghstack-source-id: 58d1fe1 Pull Request resolved: #148423

liangan1 · 2025-03-04T06:15:51Z

test/xpu/test_xpu_inductor_quantizer.py

+            def __init__(self):
+                super(Model, self).__init__()
+                self.linear = torch.nn.Linear(10, 10)
+                self.relu = torch.nn.ReLU()


Suggested change

self.relu = torch.nn.ReLU()

Thanks for suggestions, since the UT is removed, we may resolve this issue.

[ghstack-poisoned]

…wise_binary fusion ghstack-source-id: 930d36e Pull Request resolved: #148423

EikanWang · 2025-03-04T07:39:36Z

test/xpu/test_xpu_inductor_quantizer.py

@ZhiweiYan-96 , why do we need to add a dedicated test file? I suppose it should reuse other test files. Right?

Appreciation for suggestion, this file is not necessary.. I have removed the file and add 3D cases in test_mkldnn_pattern_matcher.py

[ghstack-poisoned]

EikanWang · 2025-03-05T13:08:29Z

@pytorchbot merge

pytorchmergebot · 2025-03-05T13:10:15Z

Merge failed

Reason: Approvers from one of the following sets are needed:

superuser (pytorch/metamates)
Core Reviewers (mruberry, lezcano, Skylion007, ngimel, peterbell10, ...)
Core Maintainers (soumith, gchanan, ezyang, dzhulgakov, malfet, ...)

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

ZhiweiYan-96 · 2025-03-06T12:20:19Z

Hi @jansel @desertfire,
Would you mind reviewing the code changes on the inductor side int this PR and #148522 ? Great appreciation for your suggestions.. Please note that we've already conducted an internal review of the changes in the XPU backend (aten/src/ATen/native/mkldnn/xpu/), so feel free to focus on the new additions, or review the entire change as you prefer.
Thanks for your time again 😄

ZhiweiYan-96 · 2025-03-07T01:40:06Z

@pytorchbot merge

pytorchmergebot · 2025-03-07T01:41:46Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

# Motivation&Details This PR fix a bug that blocked quantized group convolution before. The bug is caused by that, grouped convolution requires setting weight scale mask on both group dimension and output channel dimension. This PR fixs the wrong mask in integration and add grouped conv in UT. # UT ` python test/inductor/test_mkldnn_pattern_matcher.py -k test_qconv2d_xpu` # Runtime exemplification ```onednn_verbose,v1,primitive,exec,gpu:0,convolution,jit:ir,forward_training,src:s8::blocked:acdb::f0 wei:s8::blocked:abcde::f0 bia:f32::blocked:a::f0 dst:f32::blocked:acdb::f0,attr-scratchpad:user attr-scales:src0:0:f32+dst:0:f32+wei:3:f32 attr-zero-points:src0:0:s32,alg:convolution_direct,g4mb1_ic128oc128_ih4oh2kh3sh1dh0ph0_iw4ow2kw3sw1dw0pw0,0.0529785`` The verbose shows that we successfully run into quantized convolution, where weight is `abcde` format(group conv). Pull Request resolved: #148522 Approved by: https://github.com/EikanWang, https://github.com/liangan1, https://github.com/jansel ghstack dependencies: #148423

Update

c3c426d

[ghstack-poisoned]

ZhiweiYan-96 requested review from EikanWang and gujinghui as code owners March 4, 2025 05:57

pytorch-bot bot added the module: cpu CPU specific problem (e.g., perf, algorithm) label Mar 4, 2025

ZhiweiYan-96 added a commit that referenced this pull request Mar 4, 2025

[PT2E Intel GPU]: Collapse 3D input to 2D for matmul in qlinear_point… < 10000 /span>

30ba31a

…wise_binary fusion ghstack-source-id: 58d1fe1 Pull Request resolved: #148423

ZhiweiYan-96 marked this pull request as draft March 4, 2025 05:58

ZhiweiYan-96 added ciflow/xpu Run XPU CI tasks topic: not user facing topic category ciflow/inductor ciflow/trunk Trigger trunk jobs on your pull request labels Mar 4, 2025

pytorchbot added the open source label Mar 4, 2025

liangan1 reviewed Mar 4, 2025

View reviewed changes

Update

cc88720

[ghstack-poisoned]

ZhiweiYan-96 added a commit that referenced this pull request Mar 4, 2025

[PT2E Intel GPU]: Collapse 3D input to 2D for matmul in qlinear_point…

e6d4780

…wise_binary fusion ghstack-source-id: 930d36e Pull Request resolved: #148423

EikanWang reviewed Mar 4, 2025

View reviewed changes

ZhiweiYan-96 mentioned this pull request Mar 5, 2025

[Intel GPU][pt2e] Enable quantized grouped convolution at XPU #148522

Closed

ZhiweiYan-96 changed the title ~~[PT2E Intel GPU]: Collapse 3D input to 2D for matmul in qlinear_pointwise_binary fusion~~ [Intel GPU][pt2e]: Collapse 3D input to 2D for matmul in qlinear_pointwise_binary fusion Mar 5, 2025

Update

9753ff4

[ghstack-poisoned]

pytorch-bot bot added the module: inductor label Mar 5, 2025

ZhiweiYan-96 requested a review from EikanWang March 5, 2025 06:19

EikanWang approved these changes Mar 5, 2025

View reviewed changes

EikanWang marked this pull request as ready for review March 5, 2025 08:17

EikanWang added this to PyTorch Intel Mar 5, 2025

EikanWang moved this to Review Required in PyTorch Intel Mar 5, 2025

EikanWang added the keep-going Don't stop on first failure, keep running tests until the end label Mar 5, 2025

pytorchmergebot added the merging label Mar 5, 2025

pytorchmergebot removed the merging label Mar 5, 2025

EikanWang requested review from desertfire and jansel March 5, 2025 13:50

jansel approved these changes Mar 6, 2025

View reviewed changes

pytorchmergebot added the merging label Mar 7, 2025

pytorchmergebot added the Merged label Mar 7, 2025

pytorchmergebot closed this in b4430c3 Mar 7, 2025

github-project-automation bot moved this from Review Required to Done in PyTorch Intel Mar 7, 2025

pytorchmergebot removed the merging label Mar 7, 2025

github-actions bot deleted the gh/ZhiweiYan-96/51/head branch April 11, 2025 02:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Intel GPU][pt2e]: Collapse 3D input to 2D for matmul in qlinear_pointwise_binary fusion #148423

[Intel GPU][pt2e]: Collapse 3D input to 2D for matmul in qlinear_pointwise_binary fusion #148423

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[Intel GPU][pt2e]: Collapse 3D input to 2D for matmul in qlinear_pointwise_binary fusion #148423

[Intel GPU][pt2e]: Collapse 3D input to 2D for matmul in qlinear_pointwise_binary fusion #148423

Uh oh!

Conversation

Uh oh!

Motivation

Details

UT testing

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/148423

✅ No Failures

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Merge failed

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Merge started

Uh oh!

Uh oh!