[inductor] add conv_transpose2d unary fusion for cpu in inference mode #90265

chunyuan-w · 2022-12-06T07:34:02Z

Stack from ghstack (oldest at bottom):

An FX transformation is added to fuse ConvTranspose2d with eltwise OPs in torchinductor for CPU in inference mode, following the implementation in #87063.

The fusion OP is implemented in #90264 and will be treated as an extern kernel call in torchinductor.

The fusion of ConvTranspose2d with the below OPs is supported:

relu
sigmoid
tanh
hardswish
leaky_relu
hardtanh
gelu

cc @mlazos @soumith @voznesenskym @yanboliang @penguinwu @anijain2305 @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @desertfire

pytorch-bot · 2022-12-06T07:34:05Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/90265

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 Failures

As of commit 88e7c73:

NEW FAILURES - The following jobs have failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 5cc3977 Pull Request resolved: pytorch#90265

[ghstack-poisoned]

…ference mode" cc mlazos soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

XiaobingSuper · 2022-12-08T09:39:11Z

torch/_inductor/ir.py

@@ -3411,6 +3411,70 @@ def _prepare_convolution_fusion_create(
    return inputs, constant_args, kernel_layout, req_stride_order


+def _prepare_convolution_transpose_fusion_create(
+    cls,


Could we combine this code with convolution?

Updated the code to reuse the _prepare_convolution_fusion_create function for ConvTranspose.

…ference mode" cc mlazos soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

jgong5 · 2022-12-09T03:20:14Z

test/inductor/test_torchinductor.py

+            [1, 3],
+            [1, 2],
+            [1, 4],
+            [0],


Cover non-zero padding as well?

Changed padding from [0] to [0, 1] to covert the non-zero padding case.

…ference mode" cc mlazos soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

…ference mode" An FX transformation is added to fuse ConvTranspose2d with eltwise OPs in torchinductor for CPU in inference mode, following the implementation in #87063. The fusion OP is implemented in #90264 and will be treated as an extern kernel call in torchinductor. The fusion of ConvTranspose2d with the below OPs is supported: - relu - sigmoid - tanh - hardswish - leaky_relu - hardtanh - gelu cc mlazos soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

ghstack-source-id: c019aa2 Pull Request resolved: pytorch#90265

…ference mode" An FX transformation is added to fuse ConvTranspose2d with eltwise OPs in torchinductor for CPU in inference mode, following the implementation in #87063. The fusion OP is implemented in #90264 and will be treated as an extern kernel call in torchinductor. The fusion of ConvTranspose2d with the below OPs is supported: - relu - sigmoid - tanh - hardswish - leaky_relu - hardtanh - gelu cc mlazos soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

chunyuan-w · 2022-12-15T14:20:16Z

@pytorchbot merge

pytorchmergebot · 2022-12-15T14:21:58Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…ference mode" An FX transformation is added to fuse ConvTranspose2d with eltwise OPs in torchinductor for CPU in inference mode, following the implementation in #87063. The fusion OP is implemented in #90264 and will be treated as an extern kernel call in torchinductor. The fusion of ConvTranspose2d with the below OPs is supported: - relu - sigmoid - tanh - hardswish - leaky_relu - hardtanh - gelu cc mlazos soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

ezyang · 2022-12-17T05:06:20Z

@pytorchbot revert -c ghfirst -m "earlier pr on stack got yanked, this one needs to go too"

pytorchmergebot · 2022-12-17T05:07:55Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

pytorchmergebot · 2022-12-17T05:08:03Z

@chunyuan-w your PR has been successfully reverted.

…ence mode (#90265)" This reverts commit d6fe983. Reverted #90265 on behalf of https://github.com/ezyang due to earlier pr on stack got yanked, this one needs to go too

…ference mode" An FX transformation is added to fuse ConvTranspose2d with eltwise OPs in torchinductor for CPU in inference mode, following the implementation in #87063. The fusion OP is implemented in #90264 and will be treated as an extern kernel call in torchinductor. The fusion of ConvTranspose2d with the below OPs is supported: - relu - sigmoid - tanh - hardswish - leaky_relu - hardtanh - gelu cc mlazos soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

ghstack-source-id: a8958be Pull Request resolved: pytorch#90265

chunyuan-w · 2023-01-11T07:09:20Z

Re-opened in #91954

…ose2d unary fusion for cpu in inference mode" Re-land #90265. Depend on internal ideep upgrade. cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 mlazos soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

…on for cpu in inference mode" Re-land #90265. Depend on internal ideep upgrade. cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 mlazos soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

…ose2d unary fusion for cpu in inference mode" Re-land #90265. Depend on internal ideep upgrade. [Update]: internal ideep upgrade issue is resolved in #92239. cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 mlazos soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

…on for cpu in inference mode" Re-land #90265. Depend on internal ideep upgrade. [Update]: internal ideep upgrade issue is resolved in #92239. cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 mlazos soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang Guobing-Chen zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

…in inference mode (#91954) Re-land #90265. Depend on internal ideep upgrade. [Update]: internal ideep upgrade issue is resolved in #92239. Pull Request resolved: #91954 Approved by: https://github.com/jgong5, https://github.com/desertfire

github-actions bot added ciflow/inductor module: inductor labels Dec 6, 2022

This was referenced Dec 6, 2022

add conv_transpose2d pointwise(unary) fusion kernel #90264

Closed

[inductor] weight prepack for _convolution_transpose_pointwise #90266

Closed

[inductor] weight prepack for single conv_transpose2d #90267

Closed

chunyuan-w marked this pull request as draft December 6, 2022 07:35

chunyuan-w added the topic: not user facing topic category label Dec 6, 2022

pytorchbot added the open source label Dec 6, 2022

chunyuan-w added a commit to chunyuan-w/pytorch that referenced this pull request Dec 6, 2022

[inductor] add conv_transpose2d unary fusion for cpu in inference mode

96949bc

ghstack-source-id: 5cc3977 Pull Request resolved: pytorch#90265

chunyuan-w added 2 commits December 6, 2022 14:51

[inductor] add conv_transpose2d unary fusion for cpu in inference mode

fc49b10

[ghstack-poisoned]

chunyuan-w requested review from jgong5, EikanWang and XiaobingSuper December 7, 2022 03:21

XiaobingSuper reviewed Dec 8, 2022

View reviewed changes

jgong5 reviewed Dec 9, 2022

View reviewed changes

chunyuan-w marked this pull request as ready for review December 9, 2022 08:01

jgong5 approved these changes Dec 9, 2022

View reviewed changes

chunyuan-w added a commit to chunyuan-w/pytorch that referenced this pull request Dec 13, 2022

[inductor] add conv_transpose2d unary fusion for cpu in inference mode

7273cf4

ghstack-source-id: c019aa2 Pull Request resolved: pytorch#90265

chunyuan-w requested review from jansel, desertfire and Chillee December 13, 2022 10:02

jansel approved these changes Dec 13, 2022

View reviewed changes

chunyuan-w added the ciflow/trunk Trigger trunk jobs on your pull request label Dec 15, 2022

chunyuan-w added 2 commits December 15, 2022 12:33

pytorchmergebot added the Merged label Dec 15, 2022

pytorchmergebot closed this in d6fe983 Dec 15, 2022

chunyuan-w added 2 commits December 15, 2022 14:56

pytorchmergebot added the Reverted label Dec 17, 2022

chunyuan-w reopened this Dec 19, 2022

chunyuan-w added 2 commits December 19, 2022 08:53

chunyuan-w added a commit to chunyuan-w/pytorch that referenced this pull request Jan 10, 2023

[inductor] add conv_transpose2d unary fusion for cpu in inference mode

2145d26

ghstack-source-id: a8958be Pull Request resolved: pytorch#90265

chunyuan-w mentioned this pull request Jan 10, 2023

[Re-land 90265] [inductor] add conv_transpose2d unary fusion for cpu in inference mode #91954

Closed

chunyuan-w closed this Jan 11, 2023

EikanWang mentioned this pull request Jan 13, 2023

[PT2.0 Feature Proposal] TorchInductor CPU FP32 Inference Optimization #92135

Closed

facebook-github-bot deleted the gh/chunyuan-w/16/head branch June 8, 2023 15:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[inductor] add conv_transpose2d unary fusion for cpu in inference mode #90265

[inductor] add conv_transpose2d unary fusion for cpu in inference mode #90265

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[inductor] add conv_transpose2d unary fusion for cpu in inference mode #90265

[inductor] add conv_transpose2d unary fusion for cpu in inference mode #90265

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/90265

❌ 2 Failures

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Merge started

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!