Generalize poison fork logic for each device backend #144664

guangyey · 2025-01-13T06:01:21Z

pytorch-bot · 2025-01-13T06:01:24Z

pytorchmergebot · 2025-04-10T21:02:02Z

pytorchmergebot · 2025-04-10T21:02:18Z

atalman · 2025-04-10T21:18:29Z

guangyey · 2025-04-11T03:13:44Z

guangyey · 2025-04-11T10:13:07Z

torch/csrc/utils/device_lazy_init.cpp

+#ifndef WIN32

            
+  auto& flag = at_fork_once_flags[static_cast<int>(device_type)];
+  c10::call_once(flag, [device_type]() {
+    static at::DeviceType at_fork_device_type = device_type;


@albanD I see now that in some scenarios, MTIA can be built together with CUDA in a single binary wheel. So I remove the assumption that only one device type is allowed to be registered at fork detection.

Ho ok :(
Thanks for the update!

albanD

guangyey · 2025-04-13T09:46:29Z

pytorchmergebot · 2025-04-13T09:48:23Z

guangyey requested review from eqy, syed-ahmed, EikanWang, gujinghui and egienvalue as code owners January 13, 2025 06:01

guangyey mentioned this pull request Jan 13, 2025

[Don't Merge] Fix poision child process issue when call getAccelerator() #144368

Closed

pytorchbot added the open source label Jan 13, 2025

guangyey added module: accelerator Issues related to the shared accelerator API ciflow/xpu Run XPU CI tasks ciflow/rocm Trigger "default" config CI on ROCm ciflow/trunk Trigger trunk jobs on your pull request ciflow/mps Run MPS tests (subset of trunk) labels Jan 13, 2025

guangyey requested a review from albanD January 13, 2025 06:11

guangyey added topic: improvements topic category topic: not user facing topic category labels Jan 13, 2025

guangyey added a commit that referenced this pull request Jan 13, 2025

Generalize poison fork logic for each device backend

5413236

ghstack-source-id: fbbc9c3 Pull Request resolved: #144664

guangyey changed the title ~~Generalize poison fork logic for each device backend~~ [WIP] Generalize poison fork logic for each device backend Jan 13, 2025

guangyey marked this pull request as draft January 13, 2025 08:15

guangyey added 6 commits January 13, 2025 13:38

Update

7b5803a

[ghstack-poisoned]

Update

1f7558b

[ghstack-poisoned]

Update

d3ad300

[ghstack-poisoned]

Update

9c8fd16

[ghstack-poisoned]

Update
d4041e6
[ghstack-poisoned]

Update

062c914

[ghstack-poisoned]

8000 pytorchmergebot reopened this Apr 10, 2025

guangyey added a commit that referenced this pull request Apr 11, 2025

Generalize poison fork logic for each device backend

f59947c

ghstack-source-id: 73081bc Pull Request resolved: #144664

guangyey commented Apr 11, 2025

View reviewed changes

guangyey requested a review from albanD April 11, 2025 10:13

guangyey added 2 commits April 11, 2025 10:37

Update

716eefb

[ghstack-poisoned]

8000

Update

034efa9

[ghstack-poisoned]

albanD approved these changes Apr 11, 2025

View reviewed changes

pytorchmergebot added the merging label Apr 13, 2025

pytorchmergebot closed this in b081016 Apr 13, 2025

pytorchmergebot removed the merging label Apr 13, 2025

github-actions bot deleted the gh/guangyey/118/head branch May 16, 2025 02:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Generalize poison fork logic for each device backend #144664

Generalize poison fork logic for each device backend #144664

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Generalize poison fork logic for each device backend #144664

Generalize poison fork logic for each device backend #144664

Uh oh!

Conversation

Uh oh!

Motivation

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/144664

❌ 2 New Failures, 1 Cancelled Job, 4 Unrelated Failures

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Merge started

Uh oh!

Uh oh!