Add remaining method and tests for dtype propagation #140057

eellison · 2024-11-07T22:25:02Z

Stack from ghstack (oldest at bottom):

Adds the remaining unimplemented ops as well as an assertion failure if someone adds a new op without a dtype rule.

We test all unique pointwise operators registered as lowerings which have an opinfo. There will be some follow ups for this to work well with both codegen_upcast_to_fp32 as True and False.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2024-11-07T22:25:05Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/140057

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit fd40d4e with merge base 43afaa4 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 2457960 Pull Request resolved: #140057

blaine-rister · 2024-11-16T02:41:42Z

Whoops, accidentally closed this

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 6b15789 Pull Request resolved: #140057

Adds the remaining unimplemented ops as well as an assertion failure if someone adds a new op without a dtype rule. We test all unique pointwise operators registered as lowerings which have an opinfo. There will be some follow ups for this to work well with both `codegen_upcast_to_fp32` as True and False. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 6b15789 Pull Request resolved: #140057

blaine-rister · 2024-11-19T23:15:44Z

torch/_inductor/dtype_propagation.py

+
+    @staticmethod
+    def round_decimal(x: DTypeArg, y: DTypeArg) -> torch.dtype:
+        # TODO - dont see it anywhere..


Does this comment mean the op isn't actually used?

Yea, i dont see a lowering for it and I dont know how it got here...

torch/_inductor/codegen/common.py

Adds the remaining unimplemented ops as well as an assertion failure if someone adds a new op without a dtype rule. We test all unique pointwise operators registered as lowerings which have an opinfo. There will be some follow ups for this to work well with both `codegen_upcast_to_fp32` as True and False. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 6b15789 Pull Request resolved: #140057

blaine-rister

LGTM! Left a few nit comments about clarifying some of the trickier aspects.

blaine-rister · 2024-11-19T23:33:58Z

nit: Should the PR title mention dtype propagation?

arui-meta · 2024-11-20T01:45:14Z

LGTM! Thanks for working on this!

Adds the remaining unimplemented ops as well as an assertion failure if someone adds a new op without a dtype rule. We test all unique pointwise operators registered as lowerings which have an opinfo. There will be some follow ups for this to work well with both `codegen_upcast_to_fp32` as True and False. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang aakhundov [ghstack-poisoned]

[ghstack-poisoned]

eellison · 2024-11-27T16:58:54Z

@pytorchbot merge

pytorchmergebot · 2024-11-27T17:00:45Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

- Add in upcast_compute_type on creation of new tensors (loads, constants) - Fixes index_expr - right now we are sort of inconsistent in dtype and dont always respect the dtype specified. would be nice to fix but not doing in this pr. - bug fix in view dtype where we were always upcasting back to fp32 when input was in bf16/fp16. we should only be doing that if the output is also in bf16/fp16. - for masked, avoid calling dtype propagation and just use output dtype. Turns on the runtime dtype verification for opinfo tests. The separate test file is still useful because we can use it for testing turning off codegen_upcast_to_fp32. Follow ups: - We could consider requiring less explicit upcast_compute_types calls and do it automatically. That would potentially make things easier but be less flexible in the future. Maybe I should have done it this pr. - Be more consistent on our index expr dtype printing. Pull Request resolved: #141495 Approved by: https://github.com/blaine-rister, https://github.com/arui-meta, https://github.com/ezyang ghstack dependencies: #139945, #140057

Should fix compile time regression, it was doing fairly expensive meta programming in init and being instantiated multiple times. Pull Request resolved: #141882 Approved by: https://github.com/ezyang ghstack dependencies: #139945, #140057, #141495

@voznesenskym

- Set the dtype of "acc" appropriately so that epilogue fusion will have args with dtype - Update dtype propagation to use `type_to_dtype` instead of instantiating tensor - Throw if we have a string arg where we should have a proper CSEVariable, unless we're doing the Modification Subgraph thing which is nyi. everything else is appropriately typed (cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang @aakhundov @drisspg ). Pull Request resolved: #141991 Approved by: https://github.com/drisspg ghstack dependencies: #139945, #140057, #141495, #141882

Adds the remaining unimplemented ops as well as an assertion failure if someone adds a new op without a dtype rule. We test all unique pointwise operators registered as lowerings which have an opinfo. There will be some follow ups for this to work well with both `codegen_upcast_to_fp32` as True and False. Pull Request resolved: pytorch#140057 Approved by: https://github.com/arui-meta, https://github.com/blaine-rister, https://github.com/ezyang ghstack dependencies: pytorch#139945

- Add in upcast_compute_type on creation of new tensors (loads, constants) - Fixes index_expr - right now we are sort of inconsistent in dtype and dont always respect the dtype specified. would be nice to fix but not doing in this pr. - bug fix in view dtype where we were always upcasting back to fp32 when input was in bf16/fp16. we should only be doing that if the output is also in bf16/fp16. - for masked, avoid calling dtype propagation and just use output dtype. Turns on the runtime dtype verification for opinfo tests. The separate test file is still useful because we can use it for testing turning off codegen_upcast_to_fp32. Follow ups: - We could consider requiring less explicit upcast_compute_types calls and do it automatically. That would potentially make things easier but be less flexible in the future. Maybe I should have done it this pr. - Be more consistent on our index expr dtype printing. Pull Request resolved: pytorch#141495 Approved by: https://github.com/blaine-rister, https://github.com/arui-meta, https://github.com/ezyang ghstack dependencies: pytorch#139945, pytorch#140057

Should fix compile time regression, it was doing fairly expensive meta programming in init and being instantiated multiple times. Pull Request resolved: pytorch#141882 Approved by: https://github.com/ezyang ghstack dependencies: pytorch#139945, pytorch#140057, pytorch#141495

@voznesenskym

- Set the dtype of "acc" appropriately so that epilogue fusion will have args with dtype - Update dtype propagation to use `type_to_dtype` instead of instantiating tensor - Throw if we have a string arg where we should have a proper CSEVariable, unless we're doing the Modification Subgraph thing which is nyi. everything else is appropriately typed (cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang @aakhundov @drisspg ). Pull Request resolved: pytorch#141991 Approved by: https://github.com/drisspg ghstack dependencies: pytorch#139945, pytorch#140057, pytorch#141495, pytorch#141882

@voznesenskym

- Set the dtype of "acc" appropriately so that epilogue fusion will have args with dtype - Update dtype propagation to use `type_to_dtype` instead of instantiating tensor - Throw if we have a string arg where we should have a proper CSEVariable, unless we're doing the Modification Subgraph thing which is nyi. everything else is appropriately typed (cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang @aakhundov @drisspg ). Pull Request resolved: pytorch#141991 Approved by: https://github.com/drisspg ghstack dependencies: pytorch#139945, pytorch#140057, pytorch#141495, pytorch#141882

Add remaining method and tests

e2263ba

[ghstack-poisoned]

eellison mentioned this pull request Nov 7, 2024

[Easy] Refactor rsqrt lowering #139944

Closed

eellison mentioned this pull request Nov 7, 2024

Refactor dtype propagation #139945

Closed

pytorch-bot bot added ciflow/inductor module: inductor labels Nov 7, 2024

eellison mentioned this pull request Nov 7, 2024

Add a few methods #139946

Closed

eellison added a commit that referenced this pull request Nov 7, 2024

Add remaining method and tests

7ab6014

ghstack-source-id: 2457960 Pull Request resolved: #140057

eellison mentioned this pull request Nov 7, 2024

[Inductor] Expand dtype aware codegen for libdevice and tl.math ops #139839

Closed

blaine-rister mentioned this pull request Nov 16, 2024

[Inductor] Expand dtype aware codegen for libdevice and tl.math ops #140864

Closed

blaine-rister closed this Nov 16, 2024

blaine-rister reopened this Nov 16, 2024

Update on "Add remaining method and tests"

595f8fa

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang aakhundov [ghstack-poisoned]

eellison added a commit that referenced this pull request Nov 18, 2024

Add remaining method and tests

a2d739d

ghstack-source-id: 6b15789 Pull Request resolved: #140057

eellison requested review from blaine-rister and arui-meta November 18, 2024 23:44

eellison added a commit that referenced this pull request Nov 19, 2024

Add remaining method and tests

8e51302

ghstack-source-id: 6b15789 Pull Request resolved: #140057

eellison added the topic: not user facing topic category label Nov 19, 2024

blaine-rister reviewed Nov 19, 2024

View reviewed changes

torch/_inductor/codegen/common.py Outdated Show resolved Hide resolved

blaine-rister reviewed Nov 19, 2024

View reviewed changes

torch/_inductor/codegen/common.py Outdated Show resolved Hide resolved

blaine-rister reviewed Nov 19, 2024

View reviewed changes

torch/_inductor/codegen/common.py Outdated Show resolved Hide resolved

eellison added a commit that referenced this pull request Nov 19, 2024

Add remaining method and tests

35e9f05

ghstack-source-id: 6b15789 Pull Request resolved: #140057

blaine-rister reviewed Nov 19, 2024

View reviewed changes

arui-meta approved these changes Nov 20, 2024

View reviewed changes

eellison mentioned this pull request Nov 25, 2024

inductor dtype propagation fixes #141495

Closed

eellison added 8 commits November 25, 2024 19:59

Update

91f5345

[ghstack-poisoned]

Update

430439d

[ghstack-poisoned]

Update

9e5c520

[ghstack-poisoned]

Update

fd40d4e

[ghstack-poisoned]

pytorch-bot bot added the module: dynamo label Nov 27, 2024

eellison added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 27, 2024

ezyang approved these changes Nov 27, 2024

View reviewed changes

pytorchmergebot added the merging label Nov 27, 2024

pytorchmergebot added the Merged label Nov 27, 2024

pytorchmergebot closed this in fd553b9 Nov 27, 2024

pytorchmergebot removed the merging label Nov 27, 2024

github-actions bot deleted the gh/eellison/726/head branch December 28, 2024 02:03

eellison mentioned this pull request Mar 25, 2025

[Inductor] track block shape of intermediary variables #149905

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add remaining method and tests for dtype propagation #140057

Add remaining method and tests for dtype propagation #140057

Add remaining method and tests for dtype propagation #140057

Add remaining method and tests for dtype propagation #140057

Conversation

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/140057

✅ No Failures

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Merge started