[HOP] Mutation and alias rework #146658

bohnstingl · 2025-02-07T00:28:18Z

This PR reworks the way the input mutations and various aliases are checked

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @ydwu4

pytorch-bot · 2025-02-07T00:28:21Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/146658

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

CUDA not found in NVIDIA runners

✅ No Failures

As of commit 55b0301 with merge base 084c4aa ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

bohnstingl · 2025-02-07T00:37:17Z

@pytorchbot label "topic: not user facing"

…_rework

test/functorch/test_control_flow.py

torch/_dynamo/graph_deduplication.py

torch/_dynamo/variables/higher_order_ops.py

torch/_higher_order_ops/utils.py

…_rework

…loop, cond and map

bohnstingl · 2025-03-02T23:02:11Z

@ydwu4 I reworked the mutation and alias checks. I moved the checks into dynamo for scan, associative_scan, while_loop and cond. For map I also included the new check, but since it does not yet use backend='eager', I did not move the check to dynamo. Pleas let me know what you think

ydwu4

Looks good overall. Left a few minor comments.

torch/_dynamo/variables/higher_order_ops.py

ydwu4 · 2025-03-03T19:28:59Z

torch/_dynamo/variables/higher_order_ops.py

+                    # This case is for SymInts and other non-Tensor elements
+                    inputs_fake.append(val)
+            else:
+                # This case is for ints


can assert they're ints?

Done.

I was wondering though in general whether we could somehow improve this? Are you fine with we currently collect the fake inputs?

ydwu4 · 2025-03-03T19:33:34Z

torch/_higher_order_ops/flex_attention.py

    _maybe_reenter_make_fx,
    autograd_not_implemented,
+    # check_input_mutation,


Sure, I updated the PR and now reworked pretty much all HOPs

torch/_higher_order_ops/hints_wrap.py

ydwu4 · 2025-03-03T19:35:39Z

torch/_higher_order_ops/scan.py

@@ -442,27 +439,6 @@ def scan_functionalize(ctx, combine_fn, init, xs, additional_inputs):
    unwrapped_additional_inputs = ctx.unwrap_tensors(additional_inputs)
    with ctx.redispatch_to_next():
        functional_combine_fn = ctx.functionalize(combine_fn)


we might still want to keep the check in functionalization key. In case someone is using the hop directly, which could bypass the dynamo checks.

Right, as discussed offline, I reintroduced the checks into the functionalization key as well.
If the HOP uses the backend='eager', we now we have it in functionalization key and in dynamo.

ydwu4 · 2025-03-03T19:40:07Z

torch/_higher_order_ops/utils.py

+                          inp_out_alias_map,
+                          out_out_alias_map)
+
+def has_potential_input_mutation_or_alias(gm, inputs, pre_dispatch=False):


nit: Probably we should still name this as has_potential_input_alias_or_mutation? The name change seems unnecessary

I agree. I corrected it.
In fact, I revised the name of the inner helper potential_input_alias_or_mutation as well and also as a nit adjusted the return arguments. Now, as the name suggests, the aliases are returned first, followed by the mutations.

…_rework

bohnstingl · 2025-05-07T21:37:27Z

test/functorch/test_control_flow.py

+        # TODO: This is an unexpected behavior for cond
+        # Without this additional multiplication,
+        # the output of the backward graph would alias the
+        # inputs, as the gradients are just 1s and thus get optimized
        def true_fn(x):
-            return x["t"][0] + x["t"][1]["b"] * x["t"][2][0]
+            return (x["t"][0] * 2.0) + x["t"][1]["b"] * x["t"][2][0]


This may be a bit unexpected for to the user. Currently we don't allow aliases, including input-output aliases. This is problematic, because the gradients could be just 1s, in which case the gradients (arguments) from the upstream are just passed along, which then triggers the alias checks.

A naive solution would be to disable the input-output alias check, but I am not sure whether this causes problems?
Is there another solution to this?

A better way is to properly support it through auto_functionalized. This is still WIP though.

…_rework

ydwu4

Tests are failing. Added a few comments

ydwu4 · 2025-05-13T23:16:35Z

torch/_dynamo/output_graph.py

        input_storages: dict[StorageWeakRef, torch.fx.Node] = dict()

        for node in self.graph.nodes:
            if node.op == "placeholder":
-                example_value = node.meta["example_value"]
+                example_value = _collect_fake_inputs([node])[0]


what happens here? I don't expect it will error, will it?

No, it does error out. The issue is that in some testcases. For example in, the issue is that the example value is a BatchedTensorImpl, for which self.untyped_storage() doesn't exist.

ydwu4 · 2025-05-13T23:18:58Z

torch/_dynamo/variables/higher_order_ops.py

@@ -1336,6 +1340,8 @@ def create_unbacked_sym_node_var(tx) -> SymNodeVariable:
            source_target=self.value,
            set_subgraph_inputs="flatten_manual",
            should_flatten_outputs=True,
+            supports_input_mutation=False,


Can put these two as class field? like BaseHOPVariable

ydwu4 · 2025-05-13T23:19:45Z

torch/_higher_order_ops/utils.py

@@ -699,17 +741,19 @@ def validate_subgraph_args_types(lifted_args: Union[tuple[Any, ...], list[Any]])
    ), f"{lifted_args} can only be of {allowed_types} but got {tuple(type(arg) for arg in lifted_args)}"


+# TODO: Return a more detailed information as to which node
+# causes a mutation or an alias. This may requires a per operator tensor version checking
 def check_input_alias_and_mutation(


why we need to move mutated_inputs to the end?

Well, I just moved it in order to be consistent with the function name. It has alias and mutation and that's why I moved it towards the end. WDYT?

…_rework

ydwu4

The change looks good. Not sure why the test starts to fail.

…_rework

bohnstingl · 2025-05-16T23:19:26Z

I think I found the issue. There was one occasion in the inductor where I missed the rearrangement of the return variable from alias_mutation. Fingers crossed for this time.

bohnstingl added 2 commits February 7, 2025 01:14

WIP: rework

9e8651f

Lintrunner

35e91b9

pytorch-bot bot added the module: dynamo label Feb 7, 2025

pytorchbot added the open source label Feb 7, 2025

pytorch-bot bot added the topic: not user facing topic category label Feb 7, 2025

Fixed import issue with FlexAttention

0f7f261

ydwu4 self-requested a review February 7, 2025 18:10

bohnstingl added 4 commits February 7, 2025 21:06

Code cleanup

def6c92

Fixed missing import issues

08611ca

Merge branch 'main' of github.com:pytorch/pytorch into mutation_alias…

83660b9

…_rework

Integrated lifted arguments and aot_eager backend for associative_scan

a3087fe

ydwu4 reviewed Feb 12, 2025

View reviewed changes

test/functorch/test_control_flow.py Outdated Show resolved Hide resolved

ydwu4 reviewed Feb 12, 2025

View reviewed changes

torch/_dynamo/graph_deduplication.py Outdated Show resolved Hide resolved

torch/_dynamo/variables/higher_order_ops.py Outdated Show resolved Hide resolved

torch/_higher_order_ops/utils.py Outdated Show resolved Hide resolved

bohnstingl added 7 commits February 13, 2025 14:43

WIP commit

0bdf2cd

Merge branch 'main' of github.com:pytorch/pytorch into mutation_alias…

0cfe187

…_rework

WIP

c0e507d

Moved alias and mutation checks to dynamo for associative_scan and scan

9dbd727

Merge branch 'main' of github.com:pytorch/pytorch into mutation_alias…

884a18a

…_rework

Integrated mutation check to cond

b171692

Reworked alias and mutation checks for scan, associative_scan, while_…

3a977f9

…loop, cond and map

bohnstingl requested a review from ydwu4 March 2, 2025 23:02

ydwu4 reviewed Mar 3, 2025

View reviewed changes

bohnstingl added 3 commits March 4, 2025 21:33

WIP

ca6eedb

Merge branch 'main' of github.com:pytorch/pytorch into mutation_alias…

ef2e493

…_rework

Integrated code reviews

8eea0f2

bohnstingl marked this pull request as ready for review March 5, 2025 18:13

bohnstingl requested a review from zou3519 as a code owner March 5, 2025 18:13

bohnstingl added 14 commits April 30, 2025 09:24

Fixed further CI testcases and lint issues

e6831d9

Fixed cond testcase

c8131ec

Merge branch 'main' of github.com:pytorch/pytorch into mutation_alias…

781291f

…_rework

Reverted executorch HOP

48483a1

Merge branch 'main' of github.com:pytorch/pytorch into mutation_alias…

c92f378

…_rework

Fixed CI tests

738b649

Fixed CI tests

b9d261c

Merge branch 'main' of github.com:pytorch/pytorch into mutation_alias…

5802346

…_rework

Removed unnecessary code

15fe278

Fixed merge issues

ee7fc82

Merge branch 'main' of github.com:pytorch/pytorch into mutation_alias…

784969c

…_rework

Fixed CI tests

e6a763e

Update invoke_subgraph graphs

dfe3fe1

Reverted invoke_subgraph and FunctionalizeCtxWrapper

770b761

bohnstingl commented May 7, 2025

View reviewed changes

bohnstingl added 5 commits May 12, 2025 12:49

Merge branch 'main' of github.com:pytorch/pytorch into mutation_alias…

7cd9e70

…_rework

Fixed issue with return order of alias and mutation function

6ef62ea

Fixed cond testcases

089513e

Merge branch 'main' of github.com:pytorch/pytorch into mutation_alias…

58acb70

…_rework

Cleaned up code

34367a8

ydwu4 reviewed May 13, 2025

View reviewed changes

bohnstingl added 2 commits May 15, 2025 21:53

Merge branch 'main' of github.com:pytorch/pytorch into mutation_alias…

474e86d

…_rework

Fixed some CI issues and integrated review comments

fe3b370

bohnstingl requested review from ezyang and Chillee as code owners May 15, 2025 21:20

ydwu4 approved these changes May 16, 2025

View reviewed changes

bohnstingl added 2 commits May 16, 2025 23:20

Merge branch 'main' of github.com:pytorch/pytorch into mutation_alias…

e1d46b4

…_rework

Fixes to CI tests because of alias_mutation order

55b0301

bohnstingl requested a review from ydwu4 May 16, 2025 23:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[HOP] Mutation and alias rework #146658

[HOP] Mutation and alias rework #146658

[HOP] Mutation and alias rework #146658

Are you sure you want to change the base?

[HOP] Mutation and alias rework #146658

Conversation

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/146658

❗ 1 Active SEVs

✅ No Failures

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment