[sparse][semi-structured] Fix RuntimeError when passing in non-contiguous input to SparseSemiStructured linear #114593

jcaip · 2023-11-27T14:00:47Z

Summary:

This PR also brings in changes from #105595, which are needed for the changes in #110420

Currently, PyTorch incorrectly calculates the size of the returned matrix when we pass a non-contiguous batched (>2d) input to the semi-structured sparse subclass.

This is most common in MLP layers, where we have 2 linear layers back to back.

This will lead to an error like the following:

RuntimeError: shape '[20, 64, 64, 3072]' is invalid for input of size
62914560

Where the size of the sparse matmul result is off because we infer the output shape with the wrong tensor shape.

This happens because of a bug where we did not update the subclass tensor shape when doing transpose.
For semi-structured sparsity, transposing is a no-op where we just set the boolean flag, but we forgot to also update the tensor shape.

Note that this error goes away in inference mode, since we avoid decomposing the aten.linear op and handle shape folding ourselves, which changes the execution path.

An alternative way to fix this issue is to set
TORCH_FLATTEN_LINEAR_3D=True, which will also fix this error.

Test Plan:

python test/test_sparse_semi_structured.py -k test_mlp

Reviewers:

Subscribers:

Tasks:

Tags:
Pull Request resolved: #110420 Approved by: https://github.com/alexsamardzic, https://github.com/cpuhrsch

Pull Request resolved: #105595 Approved by: https://github.com/jcaip

Summary: Currently, PyTorch incorrectly calculates the size of the returned matrix when we pass a non-contiguous batched (>2d) input to the semi-structured sparse subclass. This is most common in MLP layers, where we have 2 linear layers back to back. This will lead to an error like the following: ``` RuntimeError: shape '[20, 64, 64, 3072]' is invalid for input of size 62914560 ``` Where the size of the sparse matmul result is off because we infer the output shape with the wrong tensor shape. This happens because of a bug where we did not update the subclass tensor shape when doing transpose. For semi-structured sparsity, transposing is a no-op where we just set the boolean flag, but we forgot to also update the tensor shape. Note that this error goes away in inference mode, since we avoid decomposing the aten.linear op and handle shape folding ourselves, which changes the execution path. An alternative way to fix this issue is to set TORCH_FLATTEN_LINEAR_3D=True, which will also fix this error. Test Plan: ``` python test/test_sparse_semi_structured.py -k test_mlp ``` Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: #110420 Approved by: https://github.com/alexsamardzic, https://github.com/cpuhrsch

pytorch-bot · 2023-11-27T14:00:52Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/114593

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 4091997 with merge base 138e289 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2024-01-26T17:33:50Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

alexsamardzic and others added 2 commits November 27, 2023 05:49

Minor fixes in semi-structured sparse code (#105595)

24c1b2e

Pull Request resolved: #105595 Approved by: https://github.com/jcaip

pytorch-bot bot added the release notes: sparse release notes category label Nov 27, 2023

jcaip mentioned this pull request Nov 27, 2023

[v2.1.2] Release Tracker #113962

Closed

github-actions bot added the Stale label Jan 26, 2024

jcaip closed this Feb 12, 2024

github-actions bot deleted the jcaip/semi-structured-sparse-shape-mismatch-bugfix-2.1.2 branch March 14, 2024 01:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[sparse][semi-structured] Fix RuntimeError when passing in non-contiguous input to SparseSemiStructured linear #114593

[sparse][semi-structured] Fix RuntimeError when passing in non-contiguous input to SparseSemiStructured linear #114593

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[sparse][semi-structured] Fix RuntimeError when passing in non-contiguous input to SparseSemiStructured linear #114593

[sparse][semi-structured] Fix RuntimeError when passing in non-contiguous input to SparseSemiStructured linear #114593

Uh oh!

Conversation

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/114593

✅ No Failures

Uh oh!

Uh oh!

Uh oh!