[invoke_subgraph] Run missing graph passes recursively #152675

anijain2305 · 2025-05-02T06:48:25Z

Stack from ghstack (oldest at bottom):

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2025-05-02T06:48:28Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152675

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

CI workflows being skipped on PR

✅ You can merge normally! (5 Unrelated Failures)

As of commit 239dd4e with merge base fdadda2 ():

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

inductor / cuda12.6-py3.10-gcc9-sm86 / test (inductor_torchbench, 1, 2, ephemeral.linux.g5.4xlarge.nvidia.gpu) (gh) (trunk failure)
Process completed with exit code 1.
inductor / cuda12.6-py3.10-gcc9-sm86 / test (inductor_torchbench, 2, 2, ephemeral.linux.g5.4xlarge.nvidia.gpu) (gh) (trunk failure)
Process completed with exit code 1.
inductor / unit-test / cuda12.6-py3.10-gcc9-sm86 / test (inductor_cpp_wrapper, 1, 2, ephemeral.linux.g5.4xlarge.nvidia.gpu) (gh) (trunk failure)
[ FAILED ] AotInductorTest.BasicTestCuda
trunk / macos-py3-arm64 / test (default, 2, 3, macos-m1-stable) (gh) (trunk failure)
'test/dynamo/test_unittest.py::CPythonTest_Assertions::testAssertNotRegex'

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

Lint / Link checks / Lint URLs / linux-job (gh) (#152884)
RuntimeError: Command docker exec -t 9c86239445b8add6628496ff0425283e1196889d37a85da686aa1b583ba0a4bd /exec failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 650cf70 Pull Request resolved: #152675

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: c664a07 Pull Request resolved: #152675

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 3cd873c Pull Request resolved: #152675 [invoke_subgraph] Force the output to have same strides as meta

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 2e1732a Pull Request resolved: #152675 [invoke_subgraph] Force the output to have same strides as meta

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 129dff9 Pull Request resolved: #152675 [invoke_subgraph] Force the output to have same strides as meta

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: f46fc0e Pull Request resolved: #152675 [invoke_subgraph] Force the output to have same strides as meta

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: b0a382c Pull Request resolved: #152675 [invoke_subgraph] Force the output to have same strides as meta

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 4be8784 Pull Request resolved: #152675 [invoke_subgraph] Force the output to have same strides as meta

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 4be8784 Pull Request resolved: #152675 [invoke_subgraph] Force the output to have same strides as meta

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 48ffd03 Pull Request resolved: #152675 [invoke_subgraph] Force the output to have same strides as meta

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

bdhirsh · 2025-05-05T17:31:59Z

torch/_inductor/fx_passes/post_grad.py

@@ -1207,6 +1207,15 @@ def view_to_reshape(gm):
    ):
        nd.target = torch.ops.aten.reshape.default

+    subgraph_names: OrderedSet[str] = OrderedSet()


more of a general question - we're probably going to have to audit the other one-off passes that inductor runs and ensure that they recurse too, right? I wonder if we can add some more invariants / refactor inductor's one-off passes so that they get this recursive behavior more automatically, for future graph-pass-writers (maybe worth filing a BE issue?)

+1 - can we refactor GraphTransformObserver so it applies the passes to subgraphs automatically ?

Yes, is my main concern. And the only way I did this today is by taking 4 models and ensuring that full model invoke_subgraph has same kernels as the baseline. This is definitely not exhaustive.

@eellison This one is run outside of post_grad, so we dont use GraphTransformObserver here.

bdhirsh · 2025-05-05T17:42:18Z

torch/fx/passes/fake_tensor_prop.py

+                example_inputs.append(operand.meta["val"])
+            return FakeTensorProp(
+                getattr(self.module, n.args[0].target), mode=self._mode
+            ).propagate(*example_inputs)


This reminds me - inductor also has a way of doing incremental faketensor updates through FakeTensorUpdater.incremental_update. The incremental updater might not know to properly perform incremental updates on subgraphs as well today, which could lead to silent correctness? https://github.com/pytorch/pytorch/blob/main/torch/_inductor/fx_utils.py#L153

I checked that one. So anything which is run during post_grad pass, we will recurse.

view_to_reshape is outside of post_grad. Therefore, we need this PR.

FakeTensorUpdater doesn't work on HOPs, we should fix that at some point

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

anijain2305 · 2025-05-05T23:22:21Z

@pytorchbot merge -i

pytorchmergebot · 2025-05-05T23:25:14Z

Merge started

Your change will be merged while ignoring the following 3 checks: Lint / Link checks / Lint URLs / linux-job, inductor / cuda12.6-py3.10-gcc9-sm86 / test (inductor_torchbench, 2, 2, ephemeral.linux.g5.4xlarge.nvidia.gpu), inductor / cuda12.6-py3.10-gcc9-sm86 / test (inductor_torchbench, 1, 2, ephemeral.linux.g5.4xlarge.nvidia.gpu)

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-05-06T00:02:31Z

Merge failed

Reason: 1 jobs have failed, first few of them are: inductor / unit-test / cuda12.6-py3.10-gcc9-sm86 / test (inductor_cpp_wrapper, 1, 2, ephemeral.linux.g5.4xlarge.nvidia.gpu)

Details for Dev Infra team

Raised by workflow job

anijain2305 · 2025-05-06T01:28:04Z

@pytorchbot merge -i

pytorchmergebot · 2025-05-06T01:30:21Z

Merge started

Your change will be merged while ignoring the following 5 checks: Lint / Link checks / Lint URLs / linux-job, inductor / unit-test / cuda12.6-py3.10-gcc9-sm86 / test (inductor_cpp_wrapper, 1, 2, ephemeral.linux.g5.4xlarge.nvidia.gpu), inductor / cuda12.6-py3.10-gcc9-sm86 / test (inductor_torchbench, 2, 2, ephemeral.linux.g5.4xlarge.nvidia.gpu), inductor / cuda12.6-py3.10-gcc9-sm86 / test (inductor_torchbench, 1, 2, ephemeral.linux.g5.4xlarge.nvidia.gpu), trunk / macos-py3-arm64 / test (default, 2, 3, macos-m1-stable)

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

zou3519 · 2025-05-06T02:35:40Z

torch/_inductor/fx_passes/post_grad.py

+    subgraph_names: OrderedSet[str] = OrderedSet(
+        x.target for x in gm.graph.find_nodes(op="get_attr")
+    )
+
+    for child_name, child_mod in gm.named_children():
+        if child_name in subgraph_names and isinstance(child_mod, torch.fx.GraphModule):
+            view_to_reshape(child_mod)


I would prefer some automated way of doing this, like GraphTransformObserver accepts view_to_reshape and then automatically applies the pass to subgraphs.

How many more times do we need to do this manually?

[invoke_subgraph] Run missing graph passes recursively

57f8556

[ghstack-poisoned]

anijain2305 requested a review from bdhirsh as a code owner May 2, 2025 06:48

anijain2305 mentioned this pull request May 2, 2025

[inductor][subgraph] Simplify the resulting output code for subgraph #152383

Closed

anijain2305 mentioned this pull request May 2, 2025

[invoke_subgraph] Simplify output code for subgraph output node #152490

Closed

pytorch-bot bot added the ciflow/inductor label May 2, 2025

anijain2305 mentioned this pull request May 2, 2025

[inductor][invoke_subgraph] Free the buffers before the subgraph call #152494

Closed

pytorch-bot bot added the module: inductor label May 2, 2025

anijain2305 added a commit that referenced this pull request May 2, 2025

[invoke_subgraph] Run missing graph passes recursively

1ab4263

ghstack-source-id: 650cf70 Pull Request resolved: #152675

Update on "[invoke_subgraph] Run missing graph passes recursively"

d786c21

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

anijain2305 added a commit that referenced this pull request May 2, 2025

[invoke_subgraph] Run missing graph passes recursively

96b70cb

ghstack-source-id: c664a07 Pull Request resolved: #152675

Update on "[invoke_subgraph] Run missing graph passes recursively"

e7fa1e3

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

anijain2305 added a commit that referenced this pull request May 3, 2025

[invoke_subgraph] Run missing graph passes recursively

f586fff

ghstack-source-id: 3cd873c Pull Request resolved: #152675 [invoke_subgraph] Force the output to have same strides as meta

pytorch-bot bot added the release notes: fx release notes category label May 3, 2025

anijain2305 added the ciflow/pull label May 4, 2025

Update on "[invoke_subgraph] Run missing graph passes recursively"

3639e98

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

anijain2305 added a commit that referenced this pull request May 4, 2025

[invoke_subgraph] Run missing graph passes recursively

ad85fe8

ghstack-source-id: 2e1732a Pull Request resolved: #152675 [invoke_subgraph] Force the output to have same strides as meta

anijain2305 added the topic: not user facing topic category label May 4, 2025

Update on "[invoke_subgraph] Run missing graph passes recursively"

1ef794a

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

anijain2305 mentioned this pull request May 4, 2025

[inductor][refactor] Refactor the fetching of subgraph names #152770

Closed

anijain2305 added a commit that referenced this pull request May 4, 2025

[invoke_subgraph] Run missing graph passes recursively

af6bcf2

ghstack-source-id: 129dff9 Pull Request resolved: #152675 [invoke_subgraph] Force the output to have same strides as meta

Update on "[invoke_subgraph] Run missing graph passes recursively"

3c8fe85

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

anijain2305 mentioned this pull request May 4, 2025

[fx] Recursive DCE on subgraphs #152772

Closed

anijain2305 added a commit that referenced this pull request May 4, 2025

[invoke_subgraph] Run missing graph passes recursively

0b8a2b7

ghstack-source-id: f46fc0e Pull Request resolved: #152675 [invoke_subgraph] Force the output to have same strides as meta

Update on "[invoke_subgraph] Run missing graph passes recursively"

f046bd8

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

anijain2305 added a commit that referenced this pull request May 4, 2025

[invoke_subgraph] Run missing graph passes recursively

b428bb0

ghstack-source-id: b0a382c Pull Request resolved: #152675 [invoke_subgraph] Force the output to have same strides as meta

Update on "[invoke_subgraph] Run missing graph passes recursively"

c495166

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

anijain2305 added a commit that referenced this pull request May 4, 2025

[invoke_subgraph] Run missing graph passes recursively

c7018a0

ghstack-source-id: 4be8784 Pull Request resolved: #152675 [invoke_subgraph] Force the output to have same strides as meta

Update on "[invoke_subgraph] Run missing graph passes recursively"

19bbfc4

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

anijain2305 added a commit that referenced this pull request May 4, 2025

[invoke_subgraph] Run missing graph passes recursively

47ffd06

ghstack-source-id: 4be8784 Pull Request resolved: #152675 [invoke_subgraph] Force the output to have same strides as meta

Update on "[invoke_subgraph] Run missing graph passes recursively"

0aa67bc

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

anijain2305 added a commit that referenced this pull request May 5, 2025

[invoke_subgraph] Run missing graph passes recursively

9b0f583

ghstack-source-id: 48ffd03 Pull Request resolved: #152675 [invoke_subgraph] Force the output to have same strides as meta

Update on "[invoke_subgraph] Run missing graph passes recursively"

36d6535

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

anijain2305 mentioned this pull request May 5, 2025

[invoke_subgraph] Force the output stride to be same as eager #152806

Open

anijain2305 requested review from zou3519, eellison and shunting314 May 5, 2025 14:53

Update on "[invoke_subgraph] Run missing graph passes recursively"

bde719c

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

bdhirsh reviewed May 5, 2025

View reviewed changes

bdhirsh approved these changes May 5, 2025

View reviewed changes

anijain2305 added 4 commits May 5, 2025 12:34

Update on "[invoke_subgraph] Run missing graph passes recursively"

814e68f

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

Update on "[invoke_subgraph] Run missing graph passes recursively"

cbf8ffc

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

Update on "[invoke_subgraph] Run missing graph passes recursively"

fcdf745

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

Update on "[invoke_subgraph] Run missing graph passes recursively"

239dd4e

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

F438

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label May 5, 2025

pytorchmergebot added the merging label May 5, 2025

pytorchmergebot removed the merging label May 6, 2025

pytorchmergebot added the merging label May 6, 2025

zou3519 approved these changes May 6, 2025

View reviewed changes

zou3519 reviewed May 6, 2025

View reviewed changes

pytorchmergebot added the Merged label May 6, 2025

pytorchmergebot closed this in 97dfd8d May 6, 2025

pytorchmergebot removed the merging label May 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[invoke_subgraph] Run missing graph passes recursively #152675

[invoke_subgraph] Run missing graph passes recursively #152675

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[invoke_subgraph] Run missing graph passes recursively #152675

[invoke_subgraph] Run missing graph passes recursively #152675

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152675

❗ 1 Active SEVs

✅ You can merge normally! (5 Unrelated Failures)

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Merge started

Uh oh!

Merge failed

Uh oh!

Uh oh!

Merge started

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!