[DRAFT][Reshape] Guard-free reshape for contiguous tensors to avoid data dependent errors. #148742

laithsakka · 2025-03-07T04:40:01Z

Stack from ghstack (oldest at bottom):

Main reason for refactor is to avoid data dependent error that the default path have
this is because this new path have no checks on sizes.

Does this deviate from previous behaviors/or from torch eager? specially when it comes to strides?
I need to dig deep on this, I do not have a clear answer. There was in situation where this used to diverge
from previous behavior when we reshape the input to itself but i addressed that.

In general we have three choices here:

use the new logic only for unbacked.
use it for all compile
use it for eager and compile

I do not want to spend time fixing the failing tests if we are going to go with (1) or if the idea is not accepted
thats why i would like to have this discussed first. the failures does seems not risky (change expected string or runtime assert ..etc)

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @chenyang78 @kadeng @chauhang @amjames

[ghstack-poisoned]

pytorch-bot · 2025-03-07T04:40:04Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/148742

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 10 New Failures, 4 Unrelated Failures

As of commit 8bb976d with merge base d363913 ():

NEW FAILURES - The following jobs have failed:

Check Labels / Check labels (gh)
RuntimeError: Error checking labels: PR does not have required labels
Lint / lintrunner-noclang / linux-job (gh)
>>> Lint for torch/_refs/linalg/__init__.py:
Lint / workflow-checks / linux-job (gh)
As shown by the above diff, the committed .github/workflows
pull / linux-docs / build-docs-python-false (gh)
Process completed with exit code 1.
pull / linux-focal-cuda12.6-py3.10-gcc9 / test (default, 2, 5, linux.4xlarge.nvidia.gpu) (gh)
dynamo/test_subclasses.py::SubclassTests::test_tensor_subclass_TwoTensor_clone_view
pull / linux-focal-py3_9-clang9-xla / test (xla, 1, 1, linux.12xlarge) (gh)
test_view_copy_symint_with_static_input_dyn_input_shape2
pull / linux-focal-py3.13-clang10 / test (default, 2, 5, linux.4xlarge) (gh)
dynamo/test_subclasses.py::SubclassTests::test_tensor_subclass_TwoT 10BC0 ensor_clone_view
pull / linux-focal-py3.13-clang10 / test (dynamo_wrapped, 3, 3, linux.2xlarge) (gh)
test_nestedtensor.py::TestNestedTensorSubclassCPU::test_serialization_contig_weights_only_False_cpu_float32
pull / linux-focal-py3.9-clang10 / test (default, 2, 5, linux.4xlarge) (gh)
dynamo/test_subclasses.py::SubclassTests::test_tensor_subclass_TwoTensor_clone_view
pull / linux-focal-py3.9-clang10 / test (dynamo_wrapped, 3, 3, linux.2xlarge) (gh)
test_nestedtensor.py::TestNestedTensorSubclassCPU::test_unbind_transpose_ragged_idx_last_dim_cpu

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

pull / linux-focal-cuda12.6-py3.10-gcc9-sm89 / test (default, 2, 5, linux.g6.4xlarge.experimental.nvidia.gpu) (gh) (disabled by #148214 but the issue was closed recently and a rebase is needed to make it pass)
dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_dynamic_sources_dynamic_override_dynamic_shapes
pull / linux-jammy-py3.10-clang15-asan / test (default, 2, 6, linux.4xlarge) (gh) (disabled by #148214 but the issue was closed recently and a rebase is needed to make it pass)
dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_dynamic_sources_dynamic_override_dynamic_shapes
pull / linux-jammy-py3.9-gcc11 / test (default, 2, 5, linux.2xlarge) (gh) (disabled by #148214 but the issue was closed recently and a rebase is needed to make it pass)
dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_dynamic_sources_dynamic_override_dynamic_shapes

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / unstable-linux-focal-cuda12.6-py3.10-gcc9-sm89-xfail / build (gh)
Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 4320acb Pull Request resolved: #148742

github-actions · 2025-03-07T04:40:22Z

This PR needs a `release notes:` label

If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

[ghstack-poisoned]

ghstack-source-id: 26372e1 Pull Request resolved: #148742

Main reason for refactor is to avoid data dependent error that the default path have. this is because this new path have no checks on sizes [ghstack-poisoned]

ghstack-source-id: a02a03d Pull Request resolved: #148742

Main reason for refactor is to avoid data dependent error that the default path have. this is because this new path have no checks on sizes [ghstack-poisoned]

ghstack-source-id: 3855846 Pull Request resolved: #148742

Main reason for refactor is to avoid data dependent error that the default path have. this is because this new path have no checks on sizes [ghstack-poisoned]

ghstack-source-id: cbaabe1 Pull Request resolved: #148742

ColinPeppler · 2025-03-07T19:03:23Z

torch/_refs/__init__.py

+                return prims.view_of(a)
+
+            strides = [1]
+            for x in reversed(shape[1:]):


should this be shape[:-1]?

haha maybe i am new ish to python no fancy slicing in c++.
I will try it

you mean why not
[1:][::-1]
I want to remove first element the reverse

Main reason for refactor is to avoid data dependent error that the default path have. this is because this new path have no checks on sizes [ghstack-poisoned]

ghstack-source-id: d09c8a2 Pull Request resolved: #148742

Main reason for refactor is to avoid data dependent error that the default path have this is because this new path have no checks on sizes. Does this deviate from previous behaviors/or from torch eager? specially when it comes to strides? I need to dig deep on this, I do not have a clear answer. There was in situation where this used to diverge from previous behavior when we reshape the input to itself but i addressed that. In general we have three choices here: 1. use the new logic only for unbacked. 2. use it for all compile 3. use it for eager and compile I do not want to spend time fixing the failing tests if we are going to go with (1) or if the idea is not accepted thats why i would like to have this discussed first. the failures does seems not risky (change expected string or runtime assert ..etc) cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames [ghstack-poisoned]

ghstack-source-id: 3e1561c Pull Request resolved: #148742

… to avoid data dependent errors." Main reason for refactor is to avoid data dependent error that the default path have this is because this new path have no checks on sizes. Does this deviate from previous behaviors/or from torch eager? specially when it comes to strides? I need to dig deep on this, I do not have a clear answer. There was in situation where this used to diverge from previous behavior when we reshape the input to itself but i addressed that. In general we have three choices here: 1. use the new logic only for unbacked. 2. use it for all compile 3. use it for eager and compile I do not want to spend time fixing the failing tests if we are going to go with (1) or if the idea is not accepted thats why i would like to have this discussed first. the failures does seems not risky (change expected string or runtime assert ..etc) cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames [ghstack-poisoned]

ghstack-source-id: 0c1536d Pull Request resolved: #148742

… to avoid data dependent errors." Main reason for refactor is to avoid data dependent error that the default path have this is because this new path have no checks on sizes. Does this deviate from previous behaviors/or from torch eager? specially when it comes to strides? I need to dig deep on this, I do not have a clear answer. There was in situation where this used to diverge from previous behavior when we reshape the input to itself but i addressed that. In general we have three choices here: 1. use the new logic only for unbacked. 2. use it for all compile 3. use it for eager and compile I do not want to spend time fixing the failing tests if we are going to go with (1) or if the idea is not accepted thats why i would like to have this discussed first. the failures does seems not risky (change expected string or runtime assert ..etc) cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames [ghstack-poisoned]

ghstack-source-id: e8baa07 Pull Request resolved: pytorch/pytorch#148742

github-actions · 2025-05-15T05:37:20Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

laithsakka · 2025-05-23T20:08:54Z

This was landed as different PR

guard reshape for contiguous tesnors

eb056ff

[ghstack-poisoned]

laithsakka added a commit that referenced this pull request Mar 7, 2025

guard reshape for contiguous tesnors

a278ce1

ghstack-source-id: 4320acb Pull Request resolved: #148742

laithsakka changed the title ~~guard reshape for contiguous tesnors~~ guard-free reshape for contiguous tensors Mar 7, 2025

laithsakka changed the title ~~guard-free reshape for contiguous tensors~~ Guard-free reshape for contiguous tensors Mar 7, 2025

Update on "Guard-free reshape for contiguous tensors"

8d1be85

[ghstack-poisoned]

laithsakka added a commit that referenced this pull request Mar 7, 2025

guard reshape for contiguous tesnors

2a9f1e4

ghstack-source-id: 26372e1 Pull Request resolved: #148742

laithsakka marked this pull request as draft March 7, 2025 06:05

Update on "Guard-free reshape for contiguous tensors"

3ffc51d

Main reason for refactor is to avoid data dependent error that the default path have. this is because this new path have no checks on sizes [ghstack-poisoned]

laithsakka added a commit that referenced this pull request Mar 7, 2025

guard reshape for contiguous tesnors

3aad5ba

ghstack-source-id: a02a03d Pull Request resolved: #148742

Update on "Guard-free reshape for contiguous tensors"

5f30093

Main reason for refactor is to avoid data dependent error that the default path have. this is because this new path have no checks on sizes [ghstack-poisoned]

laithsakka added a commit that referenced this pull request Mar 7, 2025

guard reshape for contiguous tesnors

1a3ca66

ghstack-source-id: 3855846 Pull Request resolved: #148742

Update on "Guard-free reshape for contiguous tensors"

f61bdd7

Main reason for refactor is to avoid data dependent error that the default path have. this is because this new path have no checks on sizes [ghstack-poisoned]

laithsakka added a commit that referenced this pull request Mar 7, 2025

guard reshape for contiguous tesnors

863fa26

ghstack-source-id: cbaabe1 Pull Request resolved: #148742

ColinPeppler reviewed Mar 7, 2025

View reviewed changes

Update on "Guard-free reshape for contiguous tensors"

66c5084

Main reason for refactor is to avoid data dependent error that the default path have. this is because this new path have no checks on sizes [ghstack-poisoned]

laithsakka added a commit that referenced this pull request Mar 7, 2025

guard reshape for contiguous tesnors

ff373b1

ghstack-source-id: d09c8a2 Pull Request resolved: #148742

pytorch-bot bot added ciflow/inductor module: dynamo labels Mar 7, 2025

laithsakka requested a review from ezyang March 7, 2025 19:58

laithsakka added the keep-going Don't stop on first failure, keep running tests until the end label Mar 8, 2025

This was referenced Mar 8, 2025

Introduce guard_or_true, guard_or_false #148430

Closed

Remove guard_size_oblivious from vector_norm decomposition. #148809

Closed

remove guard_size_oblivious from unbind. #148815

Closed

laithsakka added a commit that referenced this pull request Mar 8, 2025

guard reshape for contiguous tesnors

f4be63f

ghstack-source-id: 3e1561c Pull Request resolved: #148742

laithsakka changed the title ~~Guard-free reshape for contiguous tensors~~ [DRAFT] Guard-free reshape for contiguous tensors to avoid data dependent errors. Mar 8, 2025

laithsakka changed the title ~~[DRAFT] Guard-free reshape for contiguous tensors to avoid data dependent errors.~~ [DRAFT][Reshape] Guard-free reshape for contiguous tensors to avoid data dependent errors. Mar 8, 2025

laithsakka added a commit that referenced this pull request Mar 8, 2025

guard reshape for contiguous tesnors

79130a2

ghstack-source-id: 0c1536d Pull Request resolved: #148742

laithsakka mentioned this pull request Mar 28, 2025

[dynamic shapes] guard_or_false for _reshape_view_helper, utils._infer_size for wildcard dims #150127

Closed

Divigroup-RAP pushed a commit to Divigroup-RAP/PYTORCH that referenced this pull request Apr 22, 2025

guard reshape for contiguous tesnors

8664c38

ghstack-source-id: e8baa07 Pull Request resolved: pytorch/pytorch#148742

github-actions bot added the Stale label May 15, 2025

laithsakka closed this May 23, 2025

github-actions bot deleted the gh/laithsakka/114/head branch June 23, 2025 02:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DRAFT][Reshape] Guard-free reshape for contiguous tensors to avoid data dependent errors. #148742

[DRAFT][Reshape] Guard-free reshape for contiguous tensors to avoid data dependent errors. #148742

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[DRAFT][Reshape] Guard-free reshape for contiguous tensors to avoid data dependent errors. #148742

[DRAFT][Reshape] Guard-free reshape for contiguous tensors to avoid data dependent errors. #148742

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/148742

❌ 10 New Failures, 4 Unrelated Failures

Uh oh!

This PR needs a release notes: label

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

This PR needs a `release notes:` label