[dynamo][optimizers] Install ID_GUARDED tensors into the Fx graph #147824

anijain2305 · 2025-02-25T07:33:15Z

Stack from ghstack (oldest at bottom):

-> [dynamo][optimizers] Install ID_GUARDED tensors into the Fx graph #147824

Earlier, with inline flag we were lifting id-guarded tensors to the inputs to the Fx graph. But this offers no benefit. Main idea behind lifting parameters as inputs was to reuse the compilation units across many instances of the nn-module. However, if we are guarding on the id, we are explicitly specializing the compiled artifact to the parameter.

This PR installs the parameters back into the graph. The benefit is removal of all pre-graph bytecode to extract the id-guarded tensors from locals/globals. This increases speedup from 1.67x to 1.75x for an internal model that has large number of optimizer parameters.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2025-02-25T07:33:19Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/147824

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 3 Pending

As of commit 317a08d with merge base ce805a5 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: d202e32 Pull Request resolved: #147824

…x graph" cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames [ghstack-poisoned]

ghstack-source-id: 8429490 Pull Request resolved: #147824

anijain2305 · 2025-02-25T21:02:09Z

torch/_dynamo/variables/optimizer.py

@@ -358,7 +358,7 @@ def wrap_tensor(self, tx: "InstructionTranslator", tensor_value):
            # mark these tensors as static for cudagraphs
            mark_static_address(tensor_value)
            source = self.tensor_to_source[tensor_value]
-            self.static_tensor_names.add(tx.output.module_key_name(source.name))
+            self.static_tensor_names.add(tx.output.module_key_name(source.name()))


pre-existing bug.

…x graph" cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames [ghstack-poisoned]

…x graph" cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 714203a Pull Request resolved: #147824

…x graph" cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: aff8159 Pull Request resolved: #147824

jansel · 2025-02-27T01:21:49Z

torch/_dynamo/output_graph.py

+        for node in reversed(list(self.graph.nodes)):
+            if node.op == "get_attr" and len(list(node.users)) == 0:


Suggested change

for node in reversed(list(self.graph.nodes)):

if node.op == "get_attr" and len(list(node.users)) == 0:

for node in sorted(self.graph.find_nodes(op="get_attr"), reverse=True):

if len(list(node.users)) == 0:

So we don't need to take a pass over the entire graph.

Co-authored-by: Jason Ansel <jansel@meta.com>

anijain2305 · 2025-02-27T03:07:00Z

@pytorchbot rebase

pytorchmergebot · 2025-02-27T03:08:25Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

[ghstack-poisoned]

pytorchmergebot · 2025-02-27T03:08:38Z

Successfully rebased gh/anijain2305/691/orig onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via ghstack checkout https://github.com/pytorch/pytorch/pull/147824)

ghstack-source-id: a1ea341 Pull Request resolved: #147824

jansel · 2025-02-27T19:08:03Z

torch/_dynamo/output_graph.py

+        for node in reversed(list(self.graph.nodes)):
+            if node.op == "get_attr" and len(list(node.users)) == 0:


…x graph" Earlier, with inline flag we were lifting id-guarded tensors to the inputs to the Fx graph. But this offers no benefit. Main idea behind lifting parameters as inputs was to reuse the compilation units across many instances of the nn-module. However, if we are guarding on the `id`, we are explicitly specializing the compiled artifact to the parameter. This PR installs the parameters back into the graph. The benefit is removal of all pre-graph bytecode to extract the id-guarded tensors from locals/globals. This increases speedup from 1.67x to 1.75x for an internal model that has large number of optimizer parameters. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: a38569b Pull Request resolved: #147824

anijain2305 · 2025-02-27T20:42:52Z

@pytorchbot merge

pytorchmergebot · 2025-02-27T20:44:43Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-02-28T02:43:16Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

anijain2305 · 2025-02-28T03:20:14Z

@pytorchbot merge -f

pytorch-bot · 2025-02-28T03:20:16Z

❌ 🤖 pytorchbot command failed:

@pytorchbot merge: error: argument -f/--force: expected one argument

usage: @pytorchbot merge [-f MESSAGE | -i] [-ic] [-r [{viable/strict,main}]]

Try @pytorchbot --help for more info.

anijain2305 · 2025-02-28T03:20:38Z

@pytorchbot merge -f "stuck CI"

pytorchmergebot · 2025-02-28T03:22:00Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…torch#147824) Earlier, with inline flag we were lifting id-guarded tensors to the inputs to the Fx graph. But this offers no benefit. Main idea behind lifting parameters as inputs was to reuse the compilation units across many instances of the nn-module. However, if we are guarding on the `id`, we are explicitly specializing the compiled artifact to the parameter. This PR installs the parameters back into the graph. The benefit is removal of all pre-graph bytecode to extract the id-guarded tensors from locals/globals. This increases speedup from 1.67x to 1.75x for an internal model that has large number of optimizer parameters. Pull Request resolved: pytorch#147824 Approved by: https://github.com/jansel Co-authored-by: Jason Ansel <jansel@meta.com>

anijain2305 · 2025-04-27T07:56:04Z

This could be related to #152275

[dynamo][optimizers] Install ID_GUARDED tensors into the Fx graph

3740e5a

[ghstack-poisoned]

anijain2305 mentioned this pull request Feb 25, 2025

[dynamo][guards] Dont insert ID and TENSOR_MATCH at the same time #147819

Closed

anijain2305 added a commit that referenced this pull request Feb 25, 2025

[dynamo][optimizers] Install ID_GUARDED tensors into the Fx graph

a54ca49

ghstack-source-id: d202e32 Pull Request resolved: #147824

pytorch-bot bot added ciflow/inductor module: dynamo labels Feb 25, 2025

anijain2305 added ciflow/trunk Trigger trunk jobs on your pull request topic: not user facing topic category keep-going Don't stop on first failure, keep running tests until the end labels Feb 25, 2025

Update on "[dynamo][optimizers] Install ID_GUARDED tensors into the F…

0207a0a

…x graph" cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames [ghstack-poisoned]

anijain2305 added a commit that referenced this pull request Feb 25, 2025

[dynamo][optimizers] Install ID_GUARDED tensors into the Fx graph

be59a2b

ghstack-source-id: 8429490 Pull Request resolved: #147824

anijain2305 commented Feb 25, 2025

View reviewed changes

Update on "[dynamo][optimizers] Install ID_GUARDED tensors into the F…

aa78ffb

…x graph" cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames [ghstack-poisoned]

anijain2305 mentioned this pull request Feb 25, 2025

[logs][qol] Print log options alphabetically #147888

Closed

anijain2305 requested a review from bdhirsh as a code owner February 25, 2025 21:42

pytorch-bot bot added the module: inductor label Feb 25, 2025

anijain2305 added 3 commits February 25, 2025 16:22

Update on "[dynamo][optimizers] Install ID_GUARDED tensors into the F…

29c0508

…x graph" cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

Update on "[dynamo][optimizers] Install ID_GUARDED tensors into the F…

e8ffe5a

…x graph" cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

Update on "[dynamo][optimizers] Install ID_GUARDED tensors into the F…

0559493

…x graph" cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

anijain2305 added a commit that referenced this pull request Feb 26, 2025

[dynamo][optimizers] Install ID_GUARDED tensors into the Fx graph

8000
03523a1

ghstack-source-id: 714203a Pull Request resolved: #147824

Update on "[dynamo][optimizers] Install ID_GUARDED tensors into the F…

97f82a4

…x graph" cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

anijain2305 added a commit that referenced this pull request Feb 26, 2025

[dynamo][optimizers] Install ID_GUARDED tensors into the Fx graph

1cea2f4

ghstack-source-id: aff8159 Pull Request resolved: #147824

anijain2305 requested review from jansel and eellison February 26, 2025 18:00

jansel requested changes Feb 27, 2025

View reviewed changes

Update torch/_dynamo/output_graph.py

1fb30ea

Co-authored-by: Jason Ansel <jansel@meta.com>

anijain2305 requested a review from jansel February 27, 2025 01:26

Update

f529316

[ghstack-poisoned]

pytorchmergebot pushed a commit that referenced this pull request Feb 27, 2025

[dynamo][optimizers] Install ID_GUARDED tensors into the Fx graph

b496d8b

ghstack-source-id: a1ea341 Pull Request resolved: #147824

jansel requested changes Feb 27, 2025

View reviewed changes

anijain2305 added a commit that referenced this pull request Feb 27, 2025

[dynamo][optimizers] Install ID_GUARDED tensors into the Fx graph

b55d3e8

ghstack-source-id: a38569b Pull Request resolved: #147824

anijain2305 requested a review from jansel February 27, 2025 19:46

jansel approved these changes Feb 27, 2025

View reviewed changes

pytorchmergebot added the merging label Feb 27, 2025

pytorchmergebot added the Merged label Feb 28, 2025

pytorchmergebot closed this in eb9c127 Feb 28, 2025

pytorchmergebot removed the merging label Feb 28, 2025

github-actions bot deleted the gh/anijain2305/691/head branch March 31, 2025 02:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[dynamo][optimizers] Install ID_GUARDED tensors into the Fx graph #147824

[dynamo][optimizers] Install ID_GUARDED tensors into the Fx graph #147824

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		for node in reversed(list(self.graph.nodes)):
		if node.op == "get_attr" and len(list(node.users)) == 0:

[dynamo][optimizers] Install ID_GUARDED tensors into the Fx graph #147824

[dynamo][optimizers] Install ID_GUARDED tensors into the Fx graph #147824

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/147824

⏳ No Failures, 3 Pending

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Merge started

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Merge started

Uh oh!

Uh oh!

Uh oh!