[cudagraphs] Fix issue in collecting static_input_idxs #152287

anijain2305 · 2025-04-28T01:05:12Z

Stack from ghstack (oldest at bottom):

-> [cudagraphs] Fix issue in collecting static_input_idxs #152287

related to #152275

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2025-04-28T01:05:16Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152287

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 8ccbcc7 with merge base 8e2e06b ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 11ea1ee Pull Request resolved: #152287

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: d871635 Pull Request resolved: #152287

test/dynamo/test_subclasses.py

eellison

Thanks for looking into this

torch/_inductor/compile_fx.py

related to #152275 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 2fb99ba Pull Request resolved: #152287

bdhirsh

sgtm

bdhirsh · 2025-04-28T17:10:10Z

test/dynamo/test_subclasses.py

@@ -2135,9 +2135,9 @@ def inner_compile(
            extern_node_serializer: Optional[Callable[[list[Any]], Any]] = None,
        ):
            if dynamic:
-                self.assertEqual(static_input_idxs, [0, 1, 2, 3, 4])
+                self.assertEqual(static_input_idxs, [2, 3, 4])


This test looks strictly more correct now than previously. For sanity, this is the signature of the AOT graph in this test:

def forward(self, arg0_1: "Sym(s25)", arg1_1: "f32[s25][1]cpu", arg2_1: "f32[s25][1]cpu", arg3_1: "f32[s25][1]cpu", arg4_1: "Sym(s25)", arg5_1: "f32[s25][1]cpu"):

Where indices [2,3] correspond to the two static tensor inputs that mapped to the static TwoTensor subclass.

One thing that is wrong in this test though is that:

(1) in the dynamic shapes variant of this test, we have extra SymInt graph args that correspond to the symbolic sizes of the subclass

(2) we are marking those inputs as static indices as well, which is happening here: https://github.com/pytorch/pytorch/blob/main/torch/_functorch/_aot_autograd/subclass_utils.py#L308

This seems wrong. It might turn out not cause too many problems, if inductor has logic to properly filter out SymInts from the "static input indices" list later (given that integers have no memory address and get burned into cudagraphs anyway). But we should probably fix it either way. cc @mlazos

This makes sense, I can take a look at the issue.

mlazos · 2025-04-28T17:38:47Z

torch/_functorch/aot_autograd.py

@@ -1054,7 +1055,11 @@ def _try_get_metadata_from_dynamo(
            static_inputs_log.debug(
                "Adding static input pos %s for source %s", pos, source_name


should we update this log call as well?

related to #152275 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: be5355e Pull Request resolved: #152287

related to #152275 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 8b24e4f Pull Request resolved: #152287

anijain2305 · 2025-04-28T22:02:45Z

@pytorchbot merge

pytorchmergebot · 2025-04-28T22:04:39Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-04-29T06:56:56Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

pytorchmergebot · 2025-04-29T06:57:09Z

@anijain2305 your PR has been successfully reverted.

…)" This reverts commit 75a5646. Reverted #152287 on behalf of https://github.com/wdvr due to causing ao failures - discussed with author ([comment](#152287 (comment)))

gante · 2025-04-29T10:34:23Z

@anijain2305 thank you for the quick bugfix!

I've applied the changes in torch/_functorch/aot_autograd.py and torch/_inductor/compile_fx.py over the base torch 2.7, and I can confirm it solves the issue ✅

related to #152275 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 8e2be4a Pull Request resolved: #152287

related to #152275 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 977528d Pull Request resolved: #152287

bdhirsh · 2025-04-29T22:53:04Z

torch/_inductor/freezing.py

    # add on non param inputs
    preserved_arg_indices.extend(range(len(flat_params), len(params)))
    # is this necessary ?
+    fw_metadata.static_input_indices = static_indices_new


these are the new changes to update the static_input_indices list under freezing. As @eellison pointed out, it might be a good idea to stash this info on the graph placeholders directly in the future, so we don't need to worry about updating this list after ever calling convention change in inductor

Thanks these should be easy to merge in 2.7

anijain2305 · 2025-04-30T00:39:54Z

@pytorchbot merge

pytorchmergebot · 2025-04-30T00:42:49Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

anijain2305 · 2025-04-30T03:21:57Z

@pytorchbot merge -f "stuck merge"

pytorchmergebot · 2025-04-30T03:22:15Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

pytorchmergebot · 2025-04-30T03:23:53Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

anijain2305 · 2025-05-04T00:39:35Z

@pytorchbot cherry-pick --onto release/2.7 -c critical

related to #152275 Pull Request resolved: #152287 Approved by: https://github.com/bdhirsh, https://github.com/eellison Co-authored-by: Brian Hirsh <hirsheybar@fb.com> (cherry picked from commit 4a63cab)

pytorchbot · 2025-05-04T00:44:54Z

Cherry picking #152287

The cherry pick PR is at #152768 and it is recommended to link a critical cherry pick PR with an issue. The following tracker issues are updated:

[v2.7.1] Release Tracker #152627 (comment)

Details for Dev Infra team

Raised by workflow job

[cudagraphs] Fix issue in collecting static_input_idxs (#152287) related to #152275 Pull Request resolved: #152287 Approved by: https://github.com/bdhirsh, https://github.com/eellison (cherry picked from commit 4a63cab) Co-authored-by: Brian Hirsh <hirsheybar@fb.com>

[cudagraphs] Fix issue in collecting static_input_idxs

74bd4b0

[ghstack-poisoned]

pytorch-bot bot added ciflow/inductor module: inductor labels Apr 28, 2025

anijain2305 added a commit that referenced this pull request Apr 28, 2025

[cudagraphs] Fix issue in collecting static_input_idxs

e5a37f7

ghstack-source-id: 11ea1ee Pull Request resolved: #152287

anijain2305 added ciflow/trunk Trigger trunk jobs on your pull request topic: not user facing topic category labels Apr 28, 2025

Update on "[cudagraphs] Fix issue in collecting static_input_idxs"

48fe39a

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

pytorch-bot bot added the module: dynamo label Apr 28, 2025

anijain2305 added a commit that referenced this pull request Apr 28, 2025

[cudagraphs] Fix issue in collecting static_input_idxs

a6f2f49

ghstack-source-id: d871635 Pull Request resolved: #152287

anijain2305 commented Apr 28, 2025

View reviewed changes

test/dynamo/test_subclasses.py Outdated Show resolved Hide resolved

eellison reviewed Apr 28, 2025

View reviewed changes

torch/_inductor/compile_fx.py Outdated Show resolved Hide resolved

Update on "[cudagraphs] Fix issue in collecting static_input_idxs"

0ec01c6

related to #152275 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

anijain2305 added a commit that referenced this pull request Apr 28, 2025

[cudagraphs] Fix issue in collecting static_input_idxs

82f90e8

ghstack-source-id: 2fb99ba Pull Request resolved: #152287

bdhirsh approved these changes Apr 28, 2025

View reviewed changes

bdhirsh reviewed Apr 28, 2025

View reviewed changes

mlazos reviewed Apr 28, 2025

View reviewed changes

Update on "[cudagraphs] Fix issue in collecting static_input_idxs"

8e3ed46

related to #152275 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

anijain2305 added a commit that referenced this pull request Apr 28, 2025

[cudagraphs] Fix issue in collecting static_input_idxs

6ce63aa

ghstack-source-id: be5355e Pull Request resolved: #152287

Update on "[cudagraphs] Fix issue in collecting static_input_idxs"

f0f3526

related to #152275 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

anijain2305 added a commit that referenced this pull request Apr 28, 2025

[cudagraphs] Fix issue in collecting static_input_idxs

ed54e12

ghstack-source-id: 8b24e4f Pull Request resolved: #152287

anijain2305 requested review from eellison and mlazos April 28, 2025 20:58

eellison approved these changes Apr 28, 2025

View reviewed changes

pytorchmergebot added the merging label Apr 28, 2025

anijain2305 mentioned this pull request Apr 28, 2025

Support for B200 (sm_100 with pytorch>=2.7.0) huggingface/transformers#37824

Open

pytorchmergebot added the Merged label Apr 28, 2025

pytorchmergebot added Reverted ci-no-td Do not run TD on this PR labels Apr 29, 2025

pytorchmergebot reopened this Apr 29, 2025

Update on "[cudagraphs] Fix issue in collecting static_input_idxs"

a592e8c

related to #152275 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

anijain2305 added a commit that referenced this pull request Apr 29, 2025

[cudagraphs] Fix issue in collecting static_input_idxs

4cac7c0

ghstack-source-id: 8e2be4a Pull Request resolved: #152287

Update on "[cudagraphs] Fix issue in collecting static_input_idxs"

8ccbcc7

related to #152275 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

bdhirsh added a commit that referenced this pull request Apr 29, 2025

[cudagraphs] Fix issue in collecting static_input_idxs

86e07c1

ghstack-source-id: 977528d Pull Request resolved: #152287

bdhirsh reviewed Apr 29, 2025

View reviewed changes

pytorchmergebot added the merging label Apr 30, 2025

anijain2305 removed the Merged label Apr 30, 2025

pytorchmergebot added the Merged label Apr 30, 2025

pytorchmergebot closed this in 4a63cab Apr 30, 2025

pytorchmergebot removed the merging label Apr 30, 2025

pytorchbot mentioned this pull request May 4, 2025

[cudagraphs] Fix issue in collecting static_input_idxs #152768

Merged

pytorchbot mentioned this pull request May 4, 2025

[v2.7.1] Release Tracker #152627

Open

malfet added this to the 2.7.1 milestone May 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[cudagraphs] Fix issue in collecting static_input_idxs #152287

[cudagraphs] Fix issue in collecting static_input_idxs #152287

		@@ -1054,7 +1055,11 @@ def _try_get_metadata_from_dynamo(
		static_inputs_log.debug(
		"Adding static input pos %s for source %s", pos, source_name

[cudagraphs] Fix issue in collecting static_input_idxs #152287

[cudagraphs] Fix issue in collecting static_input_idxs #152287

Conversation

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152287

✅ No Failures

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Merge started

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Merge started

Merge started

Cherry picking #152287