Simplify _compute_symbolic_stride() #138844

aorenste · 2024-10-24T19:10:19Z

Rewrite _compute_symbolic_stride() to make it simpler and faster.

The existing code involves several inner loops in an attempt to process the common case faster - but in reality this effort is actually slower than the simpler code.

Testing:
The initial version of this PR (which passed all tests) ran both the old algorithm and new algorithm and compared the results to make sure that results were substantially the same (they weren't the same simply because the algorithm allocates new dynamic symbols as part of it).

I also measured the timing of both methods and from the cases I checked the simpler algorithm was generally about 30% faster (which was usually the "fast path" of the old algorithm).

Stack from ghstack (oldest at bottom):

cc @ezyang @SherlockNoMad @EikanWang @jgong5 @wenzhe-nrv

[ghstack-poisoned]

pytorch-bot · 2024-10-24T19:10:23Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/138844

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 3bee778 with merge base 82ce888 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

cc ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv [ghstack-poisoned]

ghstack-source-id: c27a525 Pull Request resolved: #138844

cc ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv [ghstack-poisoned]

ghstack-source-id: db29a07 Pull Request resolved: #138844

Rewrite _compute_symbolic_stride() to make it simpler and faster. The existing code involves several inner loops in an attempt to process the common case faster - but in reality this effort is actually slower than the simpler code. Testing: The initial version of this PR (which passed all tests) ran both the old algorithm and new algorithm and compared the results to make sure that results were substantially the same (they weren't the same simply because the algorithm allocates new dynamic symbols as part of it). I also measured the timing of both methods and from the cases I checked the simpler algorithm was generally about 30% faster (which was usually the "fast path" of the old algorithm). cc ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv [ghstack-poisoned]

ghstack-source-id: 9a3df31 Pull Request resolved: #138844

ezyang · 2024-12-13T14:54:07Z

I am deferring to Bob as he is also monkeying around with this code recently

aorenste · 2024-12-17T17:01:39Z

@pytorchbot merge

pytorchmergebot · 2024-12-17T17:03:26Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Rewrite _compute_symbolic_stride() to make it simpler and faster. The existing code involves several inner loops in an attempt to process the common case faster - but in reality this effort is actually slower than the simpler code. Testing: The initial version of this PR (which passed all tests) ran both the old algorithm and new algorithm and compared the results to make sure that results were substantially the same (they weren't the same simply because the algorithm allocates new dynamic symbols as part of it). I also measured the timing of both methods and from the cases I checked the simpler algorithm was generally about 30% faster (which was usually the "fast path" of the old algorithm). Pull Request resolved: pytorch#138844 Approved by: https://github.com/bobrenjc93 ghstack dependencies: pytorch#138843

WIP

e6e4a56

[ghstack-poisoned]

aorenste mentioned this pull request Oct 24, 2024

Move inner loop of _create_symbolic_sizes_strides_storage_offset into its own method #138843

Closed

pytorch-bot bot added ciflow/inductor release notes: fx release notes category labels Oct 24, 2024

facebook-github-bot added the fx label Oct 24, 2024

Update on "WIP"

bdbb200

cc ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv [ghstack-poisoned]

aorenste added a commit that referenced this pull request Oct 25, 2024

WIP

7a58657

ghstack-source-id: c27a525 Pull Request resolved: #138844

Update on "WIP"

d28ee14

cc ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv [ghstack-poisoned]

Update on "WIP"

d6a6f5d

cc ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv [ghstack-poisoned]

Update on "WIP"

01b22b7

cc ezyang SherlockNoMad EikanWang jgong5 wenzhe-nrv [ghstack-poisoned]

aorenste added a commit that referenced this pull request Oct 25, 2024

WIP

49f3060

ghstack-source-id: db29a07 Pull Request resolved: #138844

aorenste changed the title ~~WIP~~ Simplify _compute_symbolic_stride() Oct 25, 2024

aorenste requested review from ezyang and Chillee October 25, 2024 18:53

aorenste marked this pull request as ready for review October 25, 2024 18:53

aorenste added a commit that referenced this pull request Dec 13, 2024

WIP

0a5b018

ghstack-source-id: 9a3df31 Pull Request resolved: #138844

ezyang requested a review from bobrenjc93 December 13, 2024 14:53

bobrenjc93 approved these changes Dec 17, 2024

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Dec 17, 2024

pytorchmergebot added the merging label Dec 17, 2024

pytorchmergebot added the Merged label Dec 17, 2024

pytorchmergebot closed this in e7704f4 Dec 17, 2024

pytorchmergebot removed the merging label Dec 17, 2024

github-actions bot deleted the gh/aorenste/140/head branch January 18, 2025 02:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Simplify _compute_symbolic_stride() #138844

Simplify _compute_symbolic_stride() #138844

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Simplify _compute_symbolic_stride() #138844

Simplify _compute_symbolic_stride() #138844

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/138844

✅ No Failures

Uh oh!

Uh oh!

Uh oh!

Merge started

Uh oh!

Uh oh!