8000 Add noncontiguous OpInfo tests for MPS by skotapati · Pull Request #142202 · pytorch/pytorch · GitHub
[go: up one dir, main page]

Skip to content

Add noncontiguous OpInfo tests for MPS #142202

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed

Conversation

skotapati
Copy link
Collaborator
@skotapati skotapati commented Dec 6, 2024

PR to enable noncontiguous_sample OpInfo tests for MPS

This should help avoid regressions as seen in #141836

Will follow up with additional PRs to enable remaining OpInfo tests. It may not be feasible to review and merge the change if I try enabling them all at once, due to the size of the diff

@skotapati skotapati requested review from kulinseth and malfet December 6, 2024 01:42
Copy link
pytorch-bot bot commented Dec 6, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/142202

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures

As of commit d9f9e22 with merge base 7eb51e5 (image):

NEW FAILURES - The following jobs have failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@skotapati
Copy link
Collaborator Author
skotapati commented Dec 6, 2024

Still a WIP, just putting this up so I can see live CI results Ready for review

Copy link
netlify bot commented Dec 6, 2024

Deploy Preview for chimerical-cranachan-793287 ready!

Name Link
🔨 Latest commit 93b5424
🔍 Latest deploy log https://app.netlify.com/sites/chimerical-cranachan-793287/deploys/6752de91492d900008b3c17f
😎 Deploy Preview https://deploy-preview-142202--chimerical-cranachan-793287.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

@skotapati skotapati added the topic: not user facing topic category label Dec 6, 2024
@skotapati skotapati force-pushed the dev/skotapati/noncontiguous_testing branch from 3933c71 to a274133 Compare December 12, 2024 23:04
@skotapati skotapati force-pushed the dev/skotapati/noncontiguous_testing branch from 4428097 to cec0350 Compare January 6, 2025 17:55
@skotapati skotapati changed the title Expand OpInfo-based testing for MPS Add noncontiguous OpInfo tests for MPS Jan 10, 2025
@skotapati skotapati marked this pull request as ready for review January 10, 2025 06:13
@skotapati skotapati requested review from mruberry and a team as code owners January 10, 2025 06:13
@skotapati skotapati added the ciflow/mps Run MPS tests (subset of trunk) label Jan 10, 2025
@janeyx99 janeyx99 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Jan 12, 2025
@skotapati
Copy link
Collaborator Author
skotapati commented Jan 13, 2025

@albanD Could I get an override for the LOC limit in the PR sanity checks? I'm not sure how to break down the OpInfo implementation any further than I already have. Thanks!

@skotapati skotapati force-pushed the dev/skotapati/noncontiguous_testing branch from 3d884b7 to e1daa64 Compare January 14, 2025 17:35
@seemethere
Copy link
Member

@albanD Could I get an override for the LOC limit in the PR sanity checks? I'm not sure how to break down the OpInfo implementation any further than I already have. Thanks!

Howdy! I'd probably say for any change that's this big we would probably say no to bypassing the LOC limit. Is there absolutely no way of breaking up this change?

Copy link
Member
@seemethere seemethere left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should find a way to do this that does not require a 25k+ LOC change

@skotapati
Copy link
Collaborator Author

@seemethere the change got this big because I kept having to reformat common_method_invocations.py to pass the lintrunner. Perhaps I had my linter configured wrong, but it seems like common_method_invocations.py was already out of compliance so I ended up having to reformat much of the document. It is mostly just indentation changes though

One alternative could be to exempt the OpInfo definitions from the linter, is that possible?

@skotapati skotapati requested a review from seemethere January 14, 2025 21:36
Copy link
Contributor
@malfet malfet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do you mind breaking it down into several smaller PRs, and may be have some sort of a summary of how many ops are passing right now vs how many are failing.

From a reviewer side, it looks like thousands of xfail-if-mps are being added to common_methods_invocation, which IMO too verbose and isn't helpful.

  • If one needs to xfail all test_non_standard_bool it should be done via for op in ops: ops.skips.append(xfail-non-standard-bool)
  • If majority of non-contiguous ops fail for MPS, again one can codegen it with
MPS_NON_CONTIG_PASS_LIST={add, sum}
for op in op_db:
  if op.name not in MPS_NON_CONTIG_PASS_LIST:
     op.skip.append(xfail-if-non-contig-on-mps)

@skotapati skotapati force-pushed the dev/skotapati/noncontiguous_testing branch 2 times, most recently from 7096e93 to 94d1a88 Compare January 17, 2025 07:42
@skotapati skotapati added the keep-going Don't stop on first failure, keep running tests until the end label Jan 17, 2025
@skotapati
Copy link
Collaborator Author

@pytorchbot rebase

@pytorchmergebot
Copy link
Collaborator

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Successfully rebased dev/skotapati/noncontiguous_testing onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout dev/skotapati/noncontiguous_testing && git pull --rebase)

@pytorchmergebot pytorchmergebot force-pushed the dev/skotapati/noncontiguous_testing branch from bbe2916 to d9f9e22 Compare January 28, 2025 22:03
@skotapati
Copy link
Collaborator Author

Do not merge, I'll be rebasing this on top of a separate PR where the mps op db will be implemented in a more coherent way

Copy link
Contributor

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ciflow/mps Run MPS tests (subset of trunk) open source Stale topic: not user facing topic category triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants
0