Data dependent free reshape. #153198

laithsakka · 2025-05-08T20:42:01Z

Stack from ghstack (oldest at bottom):

[DRAFT] avoidance strategy for reshape_view_helper guards in compile: call _view_simple if inputs has dynamic dimensions. #153521
use known_contiguous for _prim_elementwise_meta short circuit #153441
introduce is_known _contiguous and use it for reshape and tensor meta data computation. #153432
-> Data dependent free reshape. #153198
cleanup, refactor and add missing self._dde_suppressed checks #152657

change 1

Lets consider the most general case, if torch compile is asked to reshape [u0, u1][u3, u4] -> [u5, u6] what shall it do?
The shape is general enough to represent both contiguous and non contiguous tensors, tensors where a clone free reshape can happen and other where a clone free cant happen. The current algorithm will fail due to data dependent errors.

The general idea is if its impossible to tell if the reshape can happen in place, (because for some concrete inputs
it will and other not) then its ok to take the general path and clone, instead of failing or asking the user to give hints.

In with this change reshape works as the following:

if we know the input is contiguous we will convert the reshape to view.
if compute_strides succeed we will use view. (compute_strides was changed to not fail when when unbacked presented instead it will just return nullptr if it cant compute the strides meaning we shall use a clone).
if neither 1, 2 works clone and use a view.

Side note: having a view does not mean that inductor will not clone, for inductor there is a pass that converts all views back to reshapes.

change 2 :

We trace _reshape_view_helper when doing fake tensor tracing , but not during proxy tracing. hence such tracing wont effect the graph (only compute output shapes of several operations). We should not fail there, because it should always be possible for us to pass it in case of reshape.
when reshape_symint was called we would have either cloned, or compute_strides succeeded so the view should pass.

What I did is the following: we run _reshape_view_helper, if we fail due to unbacked we call _view_simple which will succeed always for reshapes, (might fail for views when its impossible to do the view, in such case we throw the dde that was thrown by the original algorithm).

Ideally I would want to register _view_simple as the meta and avoid calling _reshape_view_helper completely but I am running some issues with the dispatcher with subclasses and I do not have time to debug it. namely one test
would end up calling view instead of view_symint during meta dispatch when i register a meta decompositions
python test/dynamo/test_subclasses.py SubclassTests.test_subclass_views_dynamic_True
#153303 will follow up with that change in a separate PR. cc @bdhirsh

Two other alternatives for registering _view_simple as meta and the try catch approach in this PR is:

call _view_simple if any input is dynamic see [DRAFT] avoidance strategy for reshape_view_helper guards in compile: call _view_simple if inputs has dynamic dimensions. #153521
if we make is_compiling works for framework code tracing (does not work rn) we can call _view_simple
is if is_compiling.

Note:

Reshape can still fail when is_contiguous is called, Next PR will handle that by calling is_known_contiguous.

cc @H-Huang @awgu @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2025-05-08T20:42:05Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/153198

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[PREEMPTIVE] Removal of ephemeral variants on scale-config.yml

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: c64eb2d Pull Request resolved: #153198