Shard RegisterDispatchKey #144364

swolchok · 2025-01-08T01:28:04Z

Stack from ghstack (oldest at bottom):

Should fix #143952 .

Testing: built PyTorch on Raspberry Pi 5; this seemed to alleviate high peak memory requirement. (I did increase shard counts for other generated files along the way, but I need to go back and figure out how much of that was strictly necessary vs. needing to use -j1 or -j2.)

Differential Revision: D67925496

Should fix #143952 . Testing: build PyTorch on Raspberry Pi 5 to demonstrate this fixes high peak memory requirement (TODO). Differential Revision: [D67925496](https://our.internmc.facebook.com/intern/diff/D67925496/) [ghstack-poisoned]

pytorch-bot · 2025-01-08T01:28:09Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/144364

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 40388ae with merge base bc57635 ():

UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:

pull / linux-jammy-py3-clang12-executorch / test (executorch, 1, 1, lf.linux.2xlarge, unstable) (gh) (#144480)
backends/xnnpack/test/ops/test_conv1d.py::TestConv1d::test_qs8_conv1d_batchnorm_seq

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-01-08T01:28:17Z

This pull request was exported from Phabricator. Differential Revision: D67925496

Should fix #143952 . Testing: build PyTorch on Raspberry Pi 5 to demonstrate this fixes high peak memory requirement (TODO). Differential Revision: [D67925496](https://our.internmc.facebook.com/intern/diff/D67925496/) ghstack-source-id: 260579275 Pull Request resolved: #144364

torchgen/gen.py

swolchok · 2025-01-08T16:29:03Z

whoops, pushed 7df69db to the wrong PR (it goes in the bottom-of-stack PR), will fix shortly

swolchok · 2025-01-08T16:56:26Z

@swolchok has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Should fix #143952 . Testing: build PyTorch on Raspberry Pi 5 to demonstrate this fixes high peak memory requirement (TODO). Differential Revision: [D67925496](https://our.internmc.facebook.com/intern/diff/D67925496/) [ghstack-poisoned]

facebook-github-bot · 2025-01-08T17:06:54Z

This pull request was exported from Phabricator. Differential Revision: D67925496

Pull Request resolved: #144364 Should fix #143952 . Testing: build PyTorch on Raspberry Pi 5 to demonstrate this fixes high peak memory requirement (TODO). Differential Revision: [D67925496](https://our.internmc.facebook.com/intern/diff/D67925496/) ghstack-source-id: 260637494

Should fix #143952 . Testing: build PyTorch on Raspberry Pi 5 to demonstrate this fixes high peak memory requirement (TODO). Differential Revision: [D67925496](https://our.internmc.facebook.com/intern/diff/D67925496/) [ghstack-poisoned]

facebook-github-bot · 2025-01-08T23:00:45Z

This pull request was exported from Phabricator. Differential Revision: D67925496

Should fix #143952 . Testing: built PyTorch on Raspberry Pi 5; this seemed to alleviate high peak memory requirement. (I did increase shard counts for other generated files along the way, but I need to go back and figure out how much of that was strictly necessary vs. needing to use -j1 or -j2.) Differential Revision: [D67925496](https://our.internmc.facebook.com/intern/diff/D67925496/) [ghstack-poisoned]

facebook-github-bot · 2025-01-09T01:07:18Z

This pull request was exported from Phabricator. Differential Revision: D67925496

Skylion007 · 2025-01-09T14:28:44Z

buckbuild.bzl

-        ":{}[{}]".format(aten_rule_name, "Register" + backend + ".cpp")
-        for backend in enabled_backends
-    ]
+        ":{}[{}]".format(aten_rule_name, "Register" + backend + "_0.cpp")


While adjusting this, f-string might be a bit cleaner

Starlark doesn't have f-strings -- bazelbuild/starlark#91

bdhirsh · 2025-01-09T17:23:12Z

torchgen/utils.py

+    def write_sharded_with_template(
+        self,
+        filename: str,
+        template_fn: str,


nit: template_fn -> template_name? (it looks like this is the string name of the input template file, not a function)

this matches the existing template_fn parameter to write_with_template and substitute_with_template. I agree that it is confusing to overload "fn" to abbreviate both "filename" and "function", and there is a lot of historical precedent for "function", so I will just do a file-wide find-replace.

file-wide find-replace

BC-Lint has informed me that I can't do that, so I will keep it template_fn for consistency :\

bdhirsh

sgtm!

Should fix #143952 . Testing: built PyTorch on Raspberry Pi 5; this seemed to alleviate high peak memory requirement. (I did increase shard counts for other generated files along the way, but I need to go back and figure out how much of that was strictly necessary vs. needing to use -j1 or -j2.) Differential Revision: [D67925496](https://our.internmc.facebook.com/intern/diff/D67925496/) [ghstack-poisoned]

Pull Request resolved: #144364 Should fix #143952 . Testing: build PyTorch on Raspberry Pi 5 to demonstrate this fixes high peak memory requirement (TODO). ghstack-source-id: 260776837 Differential Revision: [D67925496](https://our.internmc.facebook.com/intern/diff/D67925496/)

facebook-github-bot · 2025-01-09T18:52:43Z

This pull request was exported from Phabricator. Differential Revision: D67925496

Should fix #143952 . Testing: built PyTorch on Raspberry Pi 5; this seemed to alleviate high peak memory requirement. (I did increase shard counts for other generated files along the way, but I need to go back and figure out how much of that was strictly necessary vs. needing to use -j1 or -j2.) Differential Revision: [D67925496](https://our.internmc.facebook.com/intern/diff/D67925496/) [ghstack-poisoned]

facebook-github-bot · 2025-01-09T21:06:00Z

This pull request was exported from Phabricator. Differential Revision: D67925496

Should fix #143952 . Testing: built PyTorch on Raspberry Pi 5; this seemed to alleviate high peak memory requirement. (I did increase shard counts for other generated files along the way, but I need to go back and figure out how much of that was strictly necessary vs. needing to use -j1 or -j2.) Differential Revision: [D67925496](https://our.internmc.facebook.com/intern/diff/D67925496/) [ghstack-poisoned]

Pull Request resolved: #144364 Should fix #143952 . Testing: build PyTorch on Raspberry Pi 5 to demonstrate this fixes high peak memory requirement (TODO). ghstack-source-id: 260820041 Differential Revision: [D67925496](https://our.internmc.facebook.com/intern/diff/D67925496/)

facebook-github-bot · 2025-01-09T23:00:50Z

This pull request was exported from Phabricator. Differential Revision: D67925496

swolchok · 2025-01-09T23:49:03Z

linux-jammy-py3-clang12-executorch job failed but is unstable and failing on main , so I'm ignoring the failure

facebook-github-bot · 2025-01-10T18:13:29Z

@pytorchbot merge

(Initiating merge automatically since Phabricator Diff has merged)

pytorchmergebot · 2025-01-10T18:15:28Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

introduced by #144364. [ghstack-poisoned]

introduced by #144364. ghstack-source-id: 83f3994 Pull Request resolved: #144668

Shard RegisterDispatchKey

8ffc832

Should fix #143952 . Testing: build PyTorch on Raspberry Pi 5 to demonstrate this fixes high peak memory requirement (TODO). Differential Revision: [D67925496](https://our.internmc.facebook.com/intern/diff/D67925496/) [ghstack-poisoned]

swolchok mentioned this pull request Jan 8, 2025

torchgen: move dispatch_helpers out of RegisterDispatchDefinitions.ini #144363

Closed

facebook-github-bot added the fb-exported label Jan 8, 2025

swolchok added the release notes: build release notes category label Jan 8, 2025

Skylion007 reviewed Jan 8, 2025

View reviewed changes

torchgen/gen.py Show resolved Hide resolved

unbreak gen_backend_stubs

7df69db

Update on "Shard RegisterDispatchKey"

df8633a

Should fix #143952 . Testing: build PyTorch on Raspberry Pi 5 to demonstrate this fixes high peak memory requirement (TODO). Differential Revision: [D67925496](https://our.internmc.facebook.com/intern/diff/D67925496/) [ghstack-poisoned]

swolchok mentioned this pull request Jan 8, 2025

torchgen: sharded_keys should be immutable #144408

Closed

Update on "Shard RegisterDispatchKey"

5ec81d1

Should fix #143952 . Testing: build PyTorch on Raspberry Pi 5 to demonstrate this fixes high peak memory requirement (TODO). Differential Revision: [D67925496](https://our.internmc.facebook.com/intern/diff/D67925496/) [ghstack-poisoned]

swolchok requested review from Skylion007 and bdhirsh January 8, 2025 23:30

Skylion007 approved these changes Jan 9, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jan 9, 2025

Skylion007 reviewed Jan 9, 2025

View reviewed changes

bdhirsh reviewed Jan 9, 2025

View reviewed changes

bdhirsh approved these changes Jan 9, 2025

View reviewed changes

pytorchmergebot added the merging label Jan 10, 2025

pytorchmergebot added the Merged label Jan 10, 2025

pytorchmergebot closed this in b46d00c Jan 10, 2025

pytorchmergebot removed the merging label Jan 10, 2025

DDEle mentioned this pull request Jan 11, 2025

No such file or directory: '.../RegisterSparseXPU.cpp' intel/torch-xpu-ops#1279

Closed

etaf added a commit that referenced this pull request Jan 13, 2025

[Update torch-xpu-ops] Update torch-xpu-ops to resolve XPU build error

1b3cfde

introduced by #144364. [ghstack-poisoned]

etaf mentioned this pull request Jan 13, 2025

[Update torch-xpu-ops] Update torch-xpu-ops to resolve XPU build error introduced by #144364 #144668

Closed

etaf added a commit that referenced this pull request Jan 13, 2025

[Update torch-xpu-ops] Update torch-xpu-ops to resolve XPU build error

d0048d7

introduced by #144364. ghstack-source-id: 83f3994 Pull Request resolved: #144668

This was referenced Jan 14, 2025

[xpu] Compilation of pytorch failed, unable to generate RegisterSparseXPU.cpp #144718

Closed

Generalize poison fork logic for each device backend #144664

Closed

[XPU] Nightly binary builds for XPU Linux and Windows are failing since 01.11.2025 #144967

Closed

github-actions bot deleted the gh/swolchok/716/head branch February 12, 2025 02:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Shard RegisterDispatchKey #144364

Shard RegisterDispatchKey #144364

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Shard RegisterDispatchKey #144364

Shard RegisterDispatchKey #144364

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/144364

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Merge started

Uh oh!

Uh oh!