Add pinned memory support to sparse COO/CSR/CSC/BSR/BSC tensors #129645

pearu · 2024-06-27T09:52:36Z

As in the title:

To register indices/values of a sparse XYZ tensor with CUDA, the following methods are supported

sparse_xyz_tensor(indices, values, pin_memory=True)
sparse_xyz_tensor(indices, values).pin_memory()
sparse_xyz_tensor(indices.pin_memory(), values.pin_memory())

Fixes #115330

Stack from ghstack (oldest at bottom):

-> Add pinned memory support to sparse COO/CSR/CSC/BSR/BSC tensors #129645

cc @alexsamardzic @nikitaved @cpuhrsch @amjames @bhosmer @jcaip

[ghstack-poisoned]

pytorch-bot · 2024-06-27T09:52:40Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/129645

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 58348cb with merge base bdd83c4 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…nsors" As in the title. Fixes #115330 [ghstack-poisoned]

…nsors" As in the title: To register indices/values of a sparse XYZ tensor with CUDA, the following methods are supported - `sparse_xyz_tensor(indices, values, pin_memory=True)` - `sparse_xyz_tensor(indices, values).pin_memory()` - `sparse_xyz_tensor(indices.pin_memory(), values.pin_memory())` Fixes #115330 [ghstack-poisoned]

ghstack-source-id: 5eb30b2 Pull Request resolved: #129645

…nsors" As in the title: To register indices/values of a sparse XYZ tensor with CUDA, the following methods are supported - `sparse_xyz_tensor(indices, values, pin_memory=True)` - `sparse_xyz_tensor(indices, values).pin_memory()` - `sparse_xyz_tensor(indices.pin_memory(), values.pin_memory())` Fixes #115330 [ghstack-poisoned]

ghstack-source-id: b324125 Pull Request resolved: #129645

…nsors" As in the title: To register indices/values of a sparse XYZ tensor with CUDA, the following methods are supported - `sparse_xyz_tensor(indices, values, pin_memory=True)` - `sparse_xyz_tensor(indices, values).pin_memory()` - `sparse_xyz_tensor(indices.pin_memory(), values.pin_memory())` Fixes #115330 [ghstack-poisoned]

amjames

I see one major issue that should be addressed.

What should happen when one component tensor passed to _with_tensorsconstructors, but not all. In particular consider when thevalues` are pinned, but indices tensor(s) are not.

a = torch.randn(10, 10)
a[::2, ::2] = 0
sp1 = a.to_sparse_coo()
values = a._values().clone().pin_memory()
indices = a._indices()
sp2 = torch.sparse_coo_tensor(indices, values, size=(10, 10))

Here sp2.is_pinned() would return true, but sp2._indices().is_pinned() is false. This is inconsistent with

sp3 = sp1.pin_memory()

which also has sp3.is_pinned() True, but sp3._indices().is_pinned() is True. Inconsistency like this will be a pain point for some users.

I see two options:

Expect this meta data field to match across members -> We should add this to the invariant checks, or enforce this by pinning out-of-sync members if any member is pinned in the constructors.
Allow for these to be out-of-sync
- sparse.is_pinned() should be a short hand for values.is_pinned() && indices.is_pinned() (and the plain/compressed indices for sparse compressed layouts) the same way that sparse.pin_memory() is short hand for pinning all member tensors.

Personally I think option 1) is better for the simplicity and I can't see a reason why one would want some members to be pinned while others are not.

aten/src/ATen/native/sparse/SparseCsrTensor.cpp

aten/src/ATen/native/sparse/SparseTensor.cpp

pearu · 2024-06-29T06:50:23Z

I see two options: ...

Registering a CPU tensor with CUDA is essentially a copy operation to another device (although the pinned tensor can still be accessed by CPU processes). In the case of sparse tensors, we don't allow indices and values to be on different devices: an error is raised when trying to create a sparse tensor from indices and values that have different devices.

I suggest that we treat pinned and non-pinned tensors as having different devices. So, when one tries to create a sparse tensor from indices and values that have different pinning state then an exception will be raised. This corresponds to the first part of option 1).

…nsors" As in the title: To register indices/values of a sparse XYZ tensor with CUDA, the following methods are supported - `sparse_xyz_tensor(indices, values, pin_memory=True)` - `sparse_xyz_tensor(indices, values).pin_memory()` - `sparse_xyz_tensor(indices.pin_memory(), values.pin_memory())` Fixes #115330 cc alexsamardzic nikitaved cpuhrsch amjames bhosmer jcaip [ghstack-poisoned]

ghstack-source-id: 872565d Pull Request resolved: #129645

…nsors" As in the title: To register indices/values of a sparse XYZ tensor with CUDA, the following methods are supported - `sparse_xyz_tensor(indices, values, pin_memory=True)` - `sparse_xyz_tensor(indices, values).pin_memory()` - `sparse_xyz_tensor(indices.pin_memory(), values.pin_memory())` Fixes #115330 cc alexsamardzic nikitaved cpuhrsch amjames bhosmer jcaip [ghstack-poisoned]

ghstack-source-id: d9e9736 Pull Request resolved: #129645

…nsors" As in the title: To register indices/values of a sparse XYZ tensor with CUDA, the following methods are supported - `sparse_xyz_tensor(indices, values, pin_memory=True)` - `sparse_xyz_tensor(indices, values).pin_memory()` - `sparse_xyz_tensor(indices.pin_memory(), values.pin_memory())` Fixes #115330 cc alexsamardzic nikitaved cpuhrsch amjames bhosmer jcaip [ghstack-poisoned]

ghstack-source-id: 32090b8 Pull Request resolved: #129645

…nsors" As in the title: To register indices/values of a sparse XYZ tensor with CUDA, the following methods are supported - `sparse_xyz_tensor(indices, values, pin_memory=True)` - `sparse_xyz_tensor(indices, values).pin_memory()` - `sparse_xyz_tensor(indices.pin_memory(), values.pin_memory())` Fixes #115330 cc alexsamardzic nikitaved cpuhrsch amjames bhosmer jcaip [ghstack-poisoned]

ghstack-source-id: 3783168 Pull Request resolved: #129645

pearu · 2024-08-02T05:56:52Z

@pytorchbot merge

pytorchmergebot · 2024-08-02T05:58:40Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Add pinned memory support to sparse COO/CSR/CSC/BSR/BSC tensors

ece4300

[ghstack-poisoned]

pearu requested review from a team, albanD, eqy and soulitzer as code owners June 27, 2024 09:52

pytorch-bot bot added the release notes: sparse release notes category label Jun 27, 2024

pearu added open source topic: new features topic category labels Jun 27, 2024

pearu marked this pull request as draft June 27, 2024 10:41

Update on "Add pinned memory support to sparse COO/CSR/CSC/BSR/BSC te…

7e31062

…nsors" As in the title. Fixes #115330 [ghstack-poisoned]

pearu added a commit that referenced this pull request Jun 27, 2024

Add pinned memory support to sparse COO/CSR/CSC/BSR/BSC tensors

30491ab

ghstack-source-id: 5eb30b2 Pull Request resolved: #129645

pearu added the keep-going Don't stop on first failure, keep running tests until the end label Jun 28, 2024

pearu added a commit that referenced this pull request Jun 28, 2024

Add pinned memory support to sparse COO/CSR/CSC/BSR/BSC tensors

5de918f

ghstack-source-id: b324125 Pull Request resolved: #129645

pearu marked this pull request as ready for review June 28, 2024 14:40

pearu requested a review from amjames June 28, 2024 14:40

amjames requested changes Jun 28, 2024

View reviewed changes

aten/src/ATen/native/sparse/SparseCsrTensor.cpp Show resolved Hide resolved

aten/src/ATen/native/sparse/SparseTensor.cpp Show resolved Hide resolved

amjames added the module: sparse Related to torch.sparse label Jun 28, 2024

pearu added a commit that referenced this pull request Jun 29, 2024

Add pinned memory support to sparse COO/CSR/CSC/BSR/BSC tensors

8689ccb

ghstack-source-id: 872565d Pull Request resolved: #129645

pearu requested a review from amjames June 29, 2024 07:53

amjames approved these changes Jul 1, 2024

View reviewed changes

cpuhrsch approved these changes Jul 30, 2024

View reviewed changes

eqy approved these changes Jul 30, 2024

View reviewed changes

pearu added a commit that referenced this pull request Aug 1, 2024

Add pinned memory support to sparse COO/CSR/CSC/BSR/BSC tensors

d213606

ghstack-source-id: d9e9736 Pull Request resolved: #129645

pearu added a commit that referenced this pull request Aug 1, 2024

Add pinned memory support to sparse COO/CSR/CSC/BSR/BSC tensors

1b934b9

ghstack-source-id: 32090b8 Pull Request resolved: #129645

pearu added a commit that referenced this pull request Aug 1, 2024

Add pinned memory support to sparse COO/CSR/CSC/BSR/BSC tensors

081fcd3

ghstack-source-id: 3783168 Pull Request resolved: #129645

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Aug 2, 2024

pytorchmergebot added the merging label Aug 2, 2024

pytorchmergebot added the Merged label Aug 2, 2024

pytorchmergebot closed this in a4ea776 Aug 2, 2024

pytorchmergebot removed the merging label Aug 2, 2024

github-actions bot deleted the gh/pearu/130/head branch September 2, 2024 02:02

michael-diggin mentioned this pull request May 9, 2025

Loading sparse tensors in a DataLoader raises CUDA initialization error since 2.5.0 if you have already initialized CUDA #153143

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add pinned memory support to sparse COO/CSR/CSC/BSR/BSC tensors #129645

Add pinned memory support to sparse COO/CSR/CSC/BSR/BSC tensors #129645

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Add pinned memory support to sparse COO/CSR/CSC/BSR/BSC tensors #129645

Add pinned memory support to sparse COO/CSR/CSC/BSR/BSC tensors #129645

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/129645

✅ No Failures

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants