xpu: get xpu arch flags at runtime in cpp_extensions #152192

dvrogozh · 2025-04-25T15:43:39Z

This commit moves query for xpu arch flags to runtime when building SYCL extensions which allows to adjust TORCH_XPU_ARCH_LIST at python script level. That's handy for example in ci test which gives a try few variants of the list.

CC: @malfet, @jingxu10, @EikanWang, @guangyey

cc @gujinghui @EikanWang @fengyuan14 @guangyey

pytorch-bot · 2025-04-25T15:43:43Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152192

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 96f1e66 with merge base 9608e7f ():

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / linux-focal-py3_9-clang9-xla / build (gh) (trunk failure)
ninja: build stopped: subcommand failed

This comment was automatically generated by Dr. CI and updates every 15 minutes.

dvrogozh · 2025-04-25T15:44:43Z

@pytorchbot label "topic: not user facing"

dvrogozh · 2025-04-25T15:44:56Z

@pytorchbot label "module: xpu"

dvrogozh · 2025-04-25T15:50:30Z

@pytorchbot label "ciflow/xpu"

pytorch-bot · 2025-04-25T15:50:40Z

To add these label(s) (ciflow/xpu) to the PR, please first approve the workflows that are awaiting approval (scroll to the bottom of this page).

This helps ensure we don't trigger CI on this PR until it is actually authorized to do so. Please ping one of the reviewers if you do not have access to approve and run workflows.

test/test_cpp_extensions_jit.py

dvrogozh · 2025-04-28T04:24:26Z

In the last push to PR - fix of linter issue, no other changes.

guangyey · 2025-05-08T02:18:20Z

@pytorchbot rebase

pytorchmergebot · 2025-05-08T02:19:52Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

pytorchmergebot · 2025-05-08T02:19:55Z

Successfully rebased extension onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout extension && git pull --rebase)

albanD

Sure

albanD · 2025-05-08T16:06:20Z

torch/utils/cpp_extension.py

@@ -291,16 +291,25 @@ def _get_sycl_arch_list():
 # If arch list returned by _get_sycl_arch_list() is empty, then sycl kernels will be compiled
 # for default spir64 target and avoid device specific compilations entirely. Further, kernels
 # will be JIT compiled at runtime.
+def _get_sycl_target_flags():
+    if _get_sycl_arch_list() != '':


Why do you build the full arch list just to check if it's empty?

User can either specify the empty arch list externally (via TORCH_XPU_ARCH_LIST building his extension) or default arch list can be empty if that's how user's pytorch version was originally built (with empty TORCH_XPU_ARCH_LIST when building pytorch). Further, empty arch list is still a valid case since it corresponds to JIT runtime compilation. This if handles options difference between JIT and AOT cases.

guangyey · 2025-05-09T01:52:20Z

@pytorchbot rebase

pytorchmergebot · 2025-05-09T01:53:52Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

This commit moves query for xpu arch flags to runtime when building SYCL extensions which allows to adjust `TORCH_XPU_ARCH_LIST` at python script level. That's handy for example in ci test which gives a try few variants of the list. Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com>

pytorchmergebot · 2025-05-09T01:53:54Z

Successfully rebased extension onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout extension && git pull --rebase)

8000

guangyey · 2025-05-09T01:54:42Z

@pytorchbot merge

pytorchmergebot · 2025-05-09T01:56:58Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

dvrogozh requested review from fmassa, soumith and ezyang as code owners April 25, 2025 15:43

pytorch-bot bot added the topic: not user facing topic category label Apr 25, 2025

pytorch-bot bot added the module: xpu Intel XPU related issues label Apr 25, 2025

pytorchbot added the open source label Apr 25, 2025

dvrogozh mentioned this pull request Apr 25, 2025

[xpu] set aot device flags in cpp_extension #149459

Closed

guangyey approved these changes Apr 28, 2025

View reviewed changes

guangyey reviewed Apr 28, 2025

View reviewed changes

test/test_cpp_extensions_jit.py Show resolved Hide resolved

guangyey added this to PyTorch Intel Apr 28, 2025

guangyey added the ciflow/xpu Run XPU CI tasks label Apr 28, 2025

guangyey moved this to Review Required in PyTorch Intel Apr 28, 2025

guangyey added the release notes: xpu release notes category label Apr 28, 2025

dvrogozh force-pushed the extension branch from a236a49 to a1d27cb Compare April 28, 2025 04:18

pytorch-bot bot removed the ciflow/xpu Run XPU CI tasks label Apr 28, 2025

guangyey added the ciflow/xpu Run XPU CI tasks label Apr 28, 2025

dvrogozh force-pushed the extension branch from a1d27cb to 5d13a96 Compare April 28, 2025 16:23

pytorch-bot bot removed the ciflow/xpu Run XPU CI tasks label Apr 28, 2025

gujinghui approved these changes Apr 29, 2025

View reviewed changes

guangyey added keep-going Don't stop on first failure, keep running tests until the end ciflow/xpu Run XPU CI tasks labels Apr 29, 2025

pytorchmergebot force-pushed the extension branch from 5d13a96 to a20caaf Compare May 8, 2025 02:19

pytorch-bot bot removed the ciflow/xpu Run XPU CI tasks label May 8, 2025

guangyey added ciflow/xpu Run XPU CI tasks ciflow/trunk Trigger trunk jobs on your pull request labels May 8, 2025

guangyey requested review from malfet, atalman and albanD May 8, 2025 02:25

albanD approved these changes May 8, 2025

View reviewed changes

pytorchmergebot force-pushed the extension branch from a20caaf to 96f1e66 Compare May 9, 2025 01:53

pytorch-bot bot removed ciflow/trunk Trigger trunk jobs on your pull request ciflow/xpu Run XPU CI tasks labels May 9, 2025

guangyey added ciflow/xpu Run XPU CI tasks ciflow/trunk Trigger trunk jobs on your pull request labels May 9, 2025

pytorchmergebot added the merging label May 9, 2025

pytorchmergebot added the Merged label May 9, 2025

pytorchmergebot closed this in aca2c99 May 9, 2025

github-project-automation bot moved this from Review Required to Done in PyTorch Intel May 9, 2025

pytorchmergebot removed the merging label May 9, 2025

dvrogozh mentioned this pull request May 9, 2025

[RFC][API-Unstable] Support 3rd party SYCL kernels with CPP Extension API #153265

Open

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xpu: get xpu arch flags at runtime in cpp_extensions #152192

xpu: get xpu arch flags at runtime in cpp_extensions #152192

xpu: get xpu arch flags at runtime in cpp_extensions #152192

xpu: get xpu arch flags at runtime in cpp_extensions #152192

Conversation

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152192

✅ You can merge normally! (1 Unrelated Failure)

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Merge started