Enable lazy cloning in `Tensor.to` between CPU and MPS #150569

kurtamohler · 2025-04-02T20:43:24Z

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]

pytorch-bot · 2025-04-02T20:43:27Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/150569

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 5 New Failures, 1 Unrelated Failure

As of commit 8fd51f9 with merge base 56e1c23 ():

NEW FAILURES - The following jobs have failed:

Lint / lintrunner-clang / linux-job (gh)
>>> Lint for aten/src/ATen/native/mps/operations/BinaryOps.mm:
Lint / lintrunner-noclang / linux-job (gh)
>>> Lint for test/test_mps.py:
Mac MPS / macos-py3-arm64-mps / test (test_mps, 1, 1, macos-m1-13) (gh)
test_nn.py::TestNNDeviceTypeMPS::test_conv_empty_input_mps_complex64
Mac MPS / macos-py3-arm64-mps / test (test_mps, 1, 1, macos-m1-14) (gh)
test_nn.py::TestNNDeviceTypeMPS::test_conv_empty_input_mps_complex64
Mac MPS / macos-py3-arm64-mps / test (test_mps, 1, 1, macos-m2-15) (gh)
test_nn.py::TestNNDeviceTypeMPS::test_conv_empty_input_mps_complex64

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / unstable-linux-focal-cuda12.6-py3.10-gcc11-sm89-xfail / build (gh)
ninja: build stopped: subcommand failed

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: c92ef74 Pull Request resolved: #150569

kurtamohler · 2025-04-02T20:59:16Z

Because of what I mentioned here, right now there is no guarantee that a COW MPS tensor will materialize when it should. I've found some places in the codebase where MPS ops cast the const data pointer to non-const and then mutate the data (for instance, any op that calls getMTLBufferStorage and then writes to the buffer). Those ops will incorrectly write to the underlying data of a COW tensor without materializing. So I think it would be best not to merge this PR until we fix the data pointer const-correctness in the MPS ops.

[ghstack-poisoned]

ghstack-source-id: ac5309c Pull Request resolved: #150569

[ghstack-poisoned]

ghstack-source-id: ac5309c Pull Request resolved: pytorch#150569

ghstack-source-id: 057856d Pull Request resolved: #150569

[ghstack-poisoned]

ghstack-source-id: 902ece7 Pull Request resolved: #150569

[ghstack-poisoned]

ghstack-source-id: 902ece7 Pull Request resolved: pytorch#150569

ghstack-source-id: 3cd33fa Pull Request resolved: #150569

[ghstack-poisoned]

ghstack-source-id: 3cd33fa Pull Request resolved: pytorch#150569

ghstack-source-id: 6f363f6 Pull Request resolved: #150569

[ghstack-poisoned]

ghstack-source-id: 6f363f6 Pull Request resolved: pytorch#150569

ghstack-source-id: 20c5f02 Pull Request resolved: #150569

[ghstack-poisoned]

ghstack-source-id: 83cfe9e Pull Request resolved: #150569

ghstack-source-id: 6cccde2 Pull Request resolved: #150569

[ghstack-poisoned]

ghstack-source-id: fa61099 Pull Request resolved: #150569

[ghstack-poisoned]

ghstack-source-id: 65a4c5f Pull Request resolved: #150569

[ghstack-poisoned]

ghstack-source-id: ba67420 Pull Request resolved: #150569

[ghstack-poisoned]

ghstack-source-id: e508c76 Pull Request resolved: #150569

[ghstack-poisoned]

ghstack-source-id: 6c1a950 Pull Request resolved: #150569

[ghstack-poisoned]

ghstack-source-id: f382e2f Pull Request resolved: #150569

[ghstack-poisoned]

ghstack-source-id: b2facee Pull Request resolved: #150569

[ghstack-poisoned]

ghstack-source-id: a6c604a Pull Request resolved: #150569

[ghstack-poisoned]

ghstack-source-id: e5f0395 Pull Request resolved: #150569

cyyever · 2025-05-01T06:15:50Z

Is it possible to extend the support to other kinds of devices?

[ghstack-poisoned]

ghstack-source-id: 0dd5075 Pull Request resolved: #150569

kurtamohler · 2025-05-01T17:30:21Z

Is it possible to extend the support to other kinds of devices?

In general probably not, but the answer depends on the platform. The reason this can work for CPU-MPS is that M-series Macs have a shared memory space that both the cpu and gpu can access

[ghstack-poisoned]

ghstack-source-id: 738d0b6 Pull Request resolved: #150569

ghstack-source-id: 0dd5075 Pull Request resolved: pytorch#150569

[ghstack-poisoned]

ghstack-source-id: 79b47df Pull Request resolved: #150569

Update

c63231c

[ghstack-poisoned]

kurtamohler mentioned this pull request Apr 2, 2025

Enable _lazy_clone between CPU and MPS #148408

Open

kurtamohler added a commit that referenced this pull request Apr 2, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

1d8798e

ghstack-source-id: c92ef74 Pull Request resolved: #150569

pytorchbot added the open source label Apr 2, 2025

kurtamohler added release notes: lazy release notes category release notes: mps Release notes category and removed open source labels Apr 2, 2025

kurtamohler marked this pull request as draft April 2, 2025 21:00

pytorchbot added the open source label Apr 2, 2025

Update

a2f60c3

[ghstack-poisoned]

kurtamohler added a commit that referenced this pull request Apr 3, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

7ee9da0

ghstack-source-id: ac5309c Pull Request resolved: #150569

Update

825d03b

[ghstack-poisoned]

kurtamohler added a commit to kurtamohler/pytorch that referenced this pull request Apr 3, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

a907427

ghstack-source-id: ac5309c Pull Request resolved: pytorch#150569

kurtamohler added a commit that referenced this pull request Apr 3, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

cf61305

ghstack-source-id: 057856d Pull Request resolved: #150569

Update

ddccfba

[ghstack-poisoned]

kurtamohler added a commit that referenced this pull request Apr 4, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

6f78926

ghstack-source-id: 902ece7 Pull Request resolved: #150569

Update

f926e4e

[ghstack-poisoned]

kurtamohler added a commit to kurtamohler/pytorch that referenced this pull request Apr 4, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

aef97bf

ghstack-source-id: 902ece7 Pull Request resolved: pytorch#150569

kurtamohler added a commit that referenced this pull request Apr 4, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

531e6a0

ghstack-source-id: 3cd33fa Pull Request resolved: #150569

Update

f06b57e

[ghstack-poisoned]

kurtamohler added a commit to kurtamohler/pytorch that referenced this pull request Apr 4, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

daeb6c8

ghstack-source-id: 3cd33fa Pull Request resolved: pytorch#150569

kurtamohler added a commit that referenced this pull request Apr 4, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

cb1272b

ghstack-source-id: 6f363f6 Pull Request resolved: #150569

Update

8dd4452

[ghstack-poisoned]

kurtamohler added a commit to kurtamohler/pytorch that referenced this pull request Apr 4, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

8f24812

ghstack-source-id: 6f363f6 Pull Request resolved: pytorch#150569

kurtamohler mentioned this pull request Apr 4, 2025

Avoid overwriting COW data in MPS code #150721

Open

kurtamohler added a commit that referenced this pull request Apr 4, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

fee5bd0

ghstack-source-id: 20c5f02 Pull Request resolved: #150569

Update

8ce0a44

[ghstack-poisoned]

kurtamohler added a commit that referenced this pull request Apr 4, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

544816f

ghstack-source-id: 83cfe9e Pull Request resolved: #150569

kurtamohler added a commit that referenced this pull request Apr 29, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

5718448

ghstack-source-id: 6cccde2 Pull Request resolved: #150569

Update

852b3e0

[ghstack-poisoned]

kurtamohler added a commit that referenced this pull request Apr 29, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

06e04ae

ghstack-source-id: fa61099 Pull Request resolved: #150569

Update

d4e3e2d

[ghstack-poisoned]

kurtamohler added a commit that referenced this pull request Apr 30, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

c08e75d

ghstack-source-id: 65a4c5f Pull Request resolved: #150569

Update

9167150

[ghstack-poisoned]

kurtamohler added a commit that referenced this pull request Apr 30, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

4615cc0

ghstack-source-id: ba67420 Pull Request resolved: #150569

Update

8c2e2a8

[ghstack-poisoned]

kurtamohler added a commit that referenced this pull request Apr 30, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

c41fbc5

ghstack-source-id: e508c76 Pull Request resolved: #150569

Update

bd20b89

[ghstack-poisoned]

kurtamohler added a commit that referenced this pull request Apr 30, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

f7b929f

ghstack-source-id: 6c1a950 Pull Request resolved: #150569

Update

be34de7

[ghstack-poisoned]

kurtamohler added a commit that referenced this pull request May 1, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

783d22f

ghstack-source-id: f382e2f Pull Request resolved: #150569

Update

0f438a7

[ghstack-poisoned]

kurtamohler added a commit that referenced this pull request May 1, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

1dfa452

ghstack-source-id: b2facee Pull Request resolved: #150569

Update

c254e37

[ghstack-poisoned]

kurtamohler added a commit that referenced this pull request May 1, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

38a59be

ghstack-source-id: a6c604a Pull Request resolved: #150569

Update

594d7c7

[ghstack-poisoned]

kurtamohler added a commit that referenced this pull request May 1, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

e98b3ce

ghstack-source-id: e5f0395 Pull Request resolved: #150569

Update

b9c9a95

[ghstack-poisoned]

kurtamohler added a commit that referenced this pull request May 1, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

3bff169

ghstack-source-id: 0dd5075 Pull Request resolved: #150569

Update

e088508

[ghstack-poisoned]

kurtamohler added a commit that referenced this pull request May 2, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

0449bc1

ghstack-source-id: 738d0b6 Pull Request resolved: #150569

kurtamohler added a commit to kurtamohler/pytorch that referenced this pull request May 13, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

689a7ce

ghstack-source-id: 0dd5075 Pull Request resolved: pytorch#150569

kurtamohler added a commit to kurtamohler/pytorch that referenced this pull request May 14, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

6b6e236

ghstack-source-id: 0dd5075 Pull Request resolved: pytorch#150569

kurtamohler added a commit to kurtamohler/pytorch that referenced this pull request May 16, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

59539f9

ghstack-source-id: 0dd5075 Pull Request resolved: pytorch#150569

Update

8fd51f9

[ghstack-poisoned]

kurtamohler 7A47 added a commit that referenced this pull request May 16, 2025

Enable lazy cloning in Tensor.to between CPU and MPS

e94ab35

ghstack-source-id: 79b47df Pull Request resolved: #150569

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable lazy cloning in `Tensor.to` between CPU and MPS #150569

Enable lazy cloning in `Tensor.to` between CPU and MPS #150569

Enable lazy cloning in Tensor.to between CPU and MPS #150569

Are you sure you want to change the base?

Enable lazy cloning in Tensor.to between CPU and MPS #150569

Conversation

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/150569

❌ 5 New Failures, 1 Unrelated Failure

Enable lazy cloning in `Tensor.to` between CPU and MPS #150569

Enable lazy cloning in `Tensor.to` between CPU and MPS #150569