Add torch.accelerator.device_index as accelerator's device switch context #148864

guangyey · 2025-03-10T03:43:07Z

Stack from ghstack (oldest at bottom):

Refactor to use torch.accelerator.device_index instead of torch.cuda.device for generic device context manager #148880
-> Add torch.accelerator.device_index as accelerator's device switch context #148864

Motivation

We propose adding support for the Python with statement on torch.accelerator.device_index to enable device switching functionality. This enhancement would simplify writing device-agnostic code and provide benefits across all accelerators. Its device-specific counterparts include torch.cuda.device and torch.cuda._DeviceGuard.

Design Philosophy
It accepts either an Int or None as input. When None is passed, no device switch is performed. Supporting None is important for compatibility, as it's possible to encounter None values from torch.device.index.

Therefore, with this PR, we can do like this

src = 0
dst = 1
# Set src to current device
torch.accelerator.set_device_index(src)
with torch.accelerator.device_index(dst):
    # Inside with statement, we set dst to current device
    assert torch.accelerator.get_device_index() == dst
# Here the current device should be src
assert torch.accelerator.get_device_index() == src

cc @albanD @EikanWang

pytorch-bot · 2025-03-10T03:43:10Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/148864

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures, 1 Unrelated Failure

As of commit b43545c with merge base fc6e37c ():

NEW FAILURES - The following jobs have failed:

xpu / linux-jammy-xpu-2025.0-py3.9 / test (default, 1, 4, linux.idc.xpu) (gh)
inductor/test_torchinductor_opinfo.py::TestInductorOpInfoXPU::test_comprehensive__chunk_cat_xpu_bool
xpu / linux-jammy-xpu-2025.0-py3.9 / test (default, 3, 4, linux.idc.xpu) (gh)
higher_order_ops/test_invoke_subgraph.py::TestInvokeSubgraphCompile::test_triton_kernel_native
xpu / linux-jammy-xpu-2025.0-py3.9 / test (default, 4, 4, linux.idc.xpu) (gh)
inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_runtime_checks_device_type_failed_xpu

FLAKY - The following job failed but was likely due to flakiness present on trunk:

xpu / linux-jammy-xpu-2025.0-py3.9 / test (default, 2, 4, linux.idc.xpu) (gh) (similar failure)
inductor/test_torchinductor_opinfo.py::TestInductorOpInfoXPU::test_comprehensive__chunk_cat_xpu_float64

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 752b700 Pull Request resolved: #148864

ghstack-source-id: 9994801 Pull Request resolved: #148864

ghstack-source-id: 01a7f2c Pull Request resolved: #148864

[ghstack-poisoned]

guangyey · 2025-03-12T02:14:11Z

@pytorchbot rebase

pytorchmergebot · 2025-03-12T02:15:35Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

[ghstack-poisoned]

pytorchmergebot · 2025-03-12T02:15:46Z

Successfully rebased gh/guangyey/126/orig onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via ghstack checkout https://github.com/pytorch/pytorch/pull/148864)

ghstack-source-id: b294e55 Pull Request resolved: #148864

guangyey · 2025-03-14T10:30:13Z

@pytorchbot rebase

pytorchmergebot · 2025-03-14T10:31:38Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

[ghstack-poisoned]

pytorchmergebot · 2025-03-14T10:31:48Z

Successfully rebased gh/guangyey/126/orig onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via ghstack checkout https://github.com/pytorch/pytorch/pull/148864)

ghstack-source-id: d289cde Pull Request resolved: #148864

guangyey · 2025-03-14T10:38:49Z

@albanD May I know if this PR is reasonable for you.

[ghstack-poisoned]

ghstack-source-id: d07fc95 Pull Request resolved: pytorch/pytorch#148864

[ghstack-poisoned]

albanD

Thanks!

[ghstack-poisoned]

pytorchmergebot · 2025-04-25T02:09:39Z

Starting merge as part of PR stack under #148880

[ghstack-poisoned]

pytorchmergebot · 2025-04-25T02:12:33Z

Rebased gh/guangyey/127/orig onto refs/remotes/origin/viable/strict because #148880 was rebased, please pull locally before adding more changes (for example, via ghstack checkout https://github.com/pytorch/pytorch/pull/148864)

pytorchmergebot · 2025-04-25T02:19:59Z

Starting merge as part of PR stack under #148880

pytorchmergebot · 2025-04-25T09:39:22Z

Starting merge as part of PR stack under #148880

…device for generic device context manager (#148880) Pull Request resolved: #148880 Approved by: https://github.com/EikanWang, https://github.com/albanD ghstack dependencies: #148864

# Motivation Align #152474, fix the typo on UT for XPU introduced by #148864 Pull Request resolved: #152812 Approved by: https://github.com/EikanWang, https://github.com/Skylion007

guangyey added a commit that referenced this pull request Mar 10, 2025

Support torch.device as accelerator's device switch context

fe0479d

ghstack-source-id: 752b700 Pull Request resolved: #148864

pytorchbot added the open source label Mar 10, 2025

guangyey added ciflow/trunk Trigger trunk jobs on your pull request release notes: python_frontend python frontend release notes category ciflow/xpu Run XPU CI tasks labels Mar 10, 2025

guangyey requested review from EikanWang and gujinghui as code owners March 10, 2025 04:01

guangyey added a commit that referenced this pull request Mar 10, 2025

Support torch.device as accelerator's device switch context

9919e36

ghstack-source-id: 9994801 Pull Request resolved: #148864

guangyey added the ciflow/rocm Trigger "default" config CI on ROCm label Mar 10, 2025

guangyey added a commit that referenced this pull request Mar 10, 2025

Support torch.device as accelerator's device switch context

21536ee

ghstack-source-id: 01a7f2c Pull Request resolved: #148864

guangyey added the module: accelerator Issues related to the shared accelerator API label Mar 10, 2025

guangyey requested a review from albanD March 10, 2025 08:41

guangyey mentioned this pull request Mar 10, 2025

Refactor to use torch.accelerator.device_index instead of torch.cuda.device for generic device context manager #148880

Closed

guangyey added 4 commits March 10, 2025 11:18

Update

959d056

[ghstack-poisoned]

Update

1a410d5

[ghstack-poisoned]

Update

1b49b97

[ghstack-poisoned]

Update

1e42e7c

[ghstack-poisoned]

Update

2a36109

[ghstack-poisoned]

pytorchmergebot pushed a commit that referenced this pull request Mar 12, 2025

Support torch.device as accelerator's device switch context

57413d3

ghstack-source-id: b294e55 Pull Request resolved: #148864

Update

2e2986c

[ghstack-poisoned]

pytorchmergebot pushed a commit that referenced this pull request Mar 14, 2025

Support torch.device as accelerator's device switch context

056285e

ghstack-source-id: d289cde Pull Request resolved: #148864

Update

5087a56

[ghstack-poisoned]

Divigroup-RAP pushed a commit to Divigroup-RAP/PYTORCH that referenced this pull request Apr 22, 2025

Support torch.device as accelerator's device switch context

59185f9

ghstack-source-id: d07fc95 Pull Request resolved: pytorch/pytorch#148864

pytorch-bot bot added the topic: not user facing topic category label Apr 23, 2025

guangyey changed the title ~~Support torch.device as accelerator's device switch context~~ Add torch.accelerator.device_index as accelerator's device switch context Apr 23, 2025

guangyey changed the title ~~Add torch.accelerator.device_index as accelerator's device switch context~~ [WIP] Add torch.accelerator.device_index as accelerator's device switch context Apr 23, 2025

guangyey added 3 commits April 23, 2025 12:59

Update

1d53ded

[ghstack-poisoned]

Update

d27f975

[ghstack-poisoned]

Update

42487d5

[ghstack-poisoned]

albanD approved these changes Apr 23, 2025

View reviewed changes

guangyey added 6 commits April 23, 2025 16:20

Update

effb3fc

[ghstack-poisoned]

Update

16d6080

[ghstack-poisoned]

Update 8000

dad33cc

[ghstack-poisoned]

Update

f62d254

[ghstack-poisoned]

Update

c0829f7

[ghstack-poisoned]

Update

fe60ddc

[ghstack-poisoned]

guangyey changed the title ~~[WIP] Add torch.accelerator.device_index as accelerator's device switch context~~ Add torch.accelerator.device_index as accelerator's device switch context Apr 24, 2025

guangyey added 3 commits April 24, 2025 09:21

Update

c1ee27a

[ghstack-poisoned]

Update

921ce45

[ghstack-poisoned]

Update

6ae0a78

[ghstack-poisoned]

Update

b43545c

[ghstack-poisoned]

pytorchmergebot closed this in 33c75ca Apr 25, 2025

pytorchmergebot added the Merged label Apr 25, 2025

guangyey mentioned this pull request May 5, 2025

Fix typo on test_multi_device_context_manager for XPU #152812

Closed

guangyey mentioned this pull request May 7, 2025

[RFC] A device-agnostic Python runtime API design for stream-based accelerators #128403

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add torch.accelerator.device_index as accelerator's device switch context #148864

Add torch.accelerator.device_index as accelerator's device switch context #148864

Add torch.accelerator.device_index as accelerator's device switch context #148864

Add torch.accelerator.device_index as accelerator's device switch context #148864

Conversation

Motivation

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/148864

❌ 3 New Failures, 1 Unrelated Failure

Choose a reason for hiding this comment