NumPy support in torch.compile #106211

lezcano · 2023-07-28T15:59:09Z

RFC: pytorch/rfcs#54
First commit is the contents of https://github.com/Quansight-Labs/numpy_pytorch_interop/

We have already been using this in core for the last few months as a external dependency. This PR pulls all these into core.

In the next commits, I do a number of things in this order

Fix a few small issues
Make the tests that this PR adds pass
Bend backwards until lintrunner passes
Remove the optional dependency on torch_np and simply rely on the upstreamed code
Fix a number dynamo tests that were passing before (they were not tasting anything I think) and are not passing now.

Missing from this PR (but not blocking):

Have a flag that deactivates tracing NumPy functions and simply breaks. There used to be one but after the merge stopped working and I removed it. @lezcano to investigate.
Fix guarding issues w/ numpy #106431 (comment). @voznesenskym to submit a fix after we merge.

All the tests in tests/torch_np take about 75s to run.

This was a work by @ev-br, @rgommers @honno and I. I did not create this PR via ghstack (which would have been convenient) as this is a collaboration, and ghstack doesn't allow for shared contributions.

cc @mruberry @rgommers @albanD @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @ngimel @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov @anijain2305 @ev-br

pytorch-bot · 2023-07-28T15:59:12Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/106211

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 4 Unrelated Failures

As of commit 237489d:

NEW FAILURE - The following job has failed:

cuda11.8-py3.10-gcc7-sm86 / test (inductor, 1, 1, linux.g5.4xlarge.nvidia.gpu) (gh)

BROKEN TRUNK - The following job failed but were present on the merge base 22bc08d:

👉 Rebase onto the `viable/strict` branch to avoid these failures

linux-bionic-cuda11.8-py3.9-gcc7 / test (multigpu, 1, 1, linux.g5.12xlarge.nvidia.gpu) (gh)

UNSTABLE - The following jobs failed but were likely due to flakiness present on trunk and has been marked as unstable:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2023-07-28T15:59:19Z

The committers listed above are authorized under a signed CLA.

✅ login: lezcano / name: Mario Lezcano Casado (d2cbaa9, f972aa8, 64bde19, e362942, 73d66b5, 0c5ee0c, a915b12, 0523797, 3a238b5, 6c69a0b, 63d406a, efc728e, 184ae67, 4303561, d1b8895, 6342fb8, f9e95de, eb7465b, d1b4c41, 03486ba, d3b985e, 2ce9bbf, 7b0d9ee, ec26e23, 0411324, d508a40, 978a93f, 41711e5, c4dc727, bc9add5, bcbdcc3, 62bf140)
✅ login: honno / name: Matthew Barber (d2cbaa9)
✅ login: rgommers / name: Ralf Gommers (d2cbaa9)
✅ login: voznesenskym / name: Michael Voznesensky (22865ad)
✅ login: ezyang / name: Edward Z. Yang (237489d)

lezcano · 2023-07-28T16:01:00Z

@honno @rgommers @ev-br, could you please sign the CLA? ~~Not sure why @ev-br does not show as a coauthor. I'll look into that.~~ nvm he's there.

lezcano · 2023-07-28T16:06:54Z

For the reviewers, could you have a look at commits you feel comfortable reviewing? All the large ones are just either automated changes or trying to make lintrunner pass.

lezcano · 2023-07-28T16:09:24Z

Also, someone should comment on, legally speaking, how correct is it to upstream the NumPy tests. @rgommers @ezyang @malfet

larryliu0820 · 2023-07-28T16:13:25Z

I think all authors need to sign CLA

larryliu0820 · 2023-07-28T18:00:45Z

Does this mean numpy becomes a dependency of pytorch?

test/dynamo/test_misc.py

torch/_dynamo/variables/tensor.py

torch/_numpy/__init__.py

rgommers · 2023-07-28T18:30:51Z

Also, someone should comment on, legally speaking, how correct is it to upstream the NumPy tests. @rgommers @ezyang @malfet

This should be fine, we've included code from many other compatibly-licensed projects in the code base before. I suggest adding an entry in third_party/LICENSES_BUNDLED.txt pointing at the relevant directory. If it's within a (part of) a single file, then also add a comment (e.g., see at the top of test/test_typing.py which already says "based on NumPy ...").

Does this mean numpy becomes a dependency of pytorch?

No, no dependency is added in this PR, the changes are self-contained. numpy is already an optional runtime dependency, and required at build time (see pyproject.toml).

janeyx99 · 2023-07-28T21:03:29Z

I notice there are lots of places where tests rely on pytest and pytest tools.

If these tests are to be upstreamed, their test infra dependencies will have to be reconciled with our current test infra--we have our own nuanced equivalents of parametrize() (see

pytorch/torch/testing/_internal/common_utils.py

Line 393 in 0cf9189

class parametrize(_TestParametrizer):

) as well as xfails and our general OpInfos (see

pytorch/torch/testing/_internal/opinfo/core.py

Line 341 in 0cf9189

# Note [OpInfos]

). In general, we don't want to introduce a pytest dependency + we have special handling in our test infra so bringing consistency there will be important.

For example, I have not gotten a chance to go through the numpy op testing, but we probably want to:

reuse the OpInfo infra if there is sufficient 1:1 correspondence between torch ops and np ops, e.g., introduce a flag in OpInfo to do additional np checks if the flag is set
Write out NumpyOpInfo if numpy ops do not match 1:1 with torch ops and plug into a common way to parametrize on dtype/certain levels of support.

lezcano · 2023-07-29T00:50:49Z

@janeyx99 these tests are copied with minor edits from NumPy's test suite. I don't think it'd be realistic to port them to our own suite. In fact, we tried very hard to make them pass with as few modifications as possible, as to be sure that we had implemented a compat layer that was replicating NumPy faithfully.

That being said, as I mentioned in in the OP, I am still to remove imports and dependencies on internal NumPy objects, as to not create dependencies on non-public NumPy objects.

If people really feel strongly about it, I could try to make the tests not depend on NumPy at all, but that would require significantly more effort, of course.

As to the pytest dependency, I see that it must be now at least an optional dependency, as there are many files like test_typing.py that use it, so this would be no different. Now, if people feel rather strongly about it, we could port it to use our internal tools, of course.

lezcano · 2023-07-29T00:53:53Z

Also, we discussed doing something like having our own OpInfos at the beginning of this project and we discarded it. Getting to the point where we are now when it comes to OpInfos would involve much more budget than we had for this project (the current OpInfos were implemented by many engineers throughout many months of work) so, even if it were cleaner, I don't think that the cost/benefit would be on our side.

ezyang · 2023-07-29T02:30:49Z

Supporting @lezcano here; I also agree that it would be a lot of "makework" to "port" Numpy's test suite to PyTorch style. In fact, I have the opposite question, which is that in an ideal world, we would occasionally take updates from Numpy's test suite as Numpy evolves and we evolve with it. Is the test suite identical enough that this would be possible to do? (Removing the dependencies to Numpy internals would make this harder! But maybe it is more important to be broadly compatible with many versions of Numpy than it is to be able to copy paste in the Numpy suite?)

lezcano · 2023-07-29T02:51:23Z

We discussed the point of automatising the process of bringing tests from the NumPy test suite, but it's not particularly easy.

To put together this testing suite, we went through a reasonable curation of these tests, where some were skipped as they used features we were never going to support (endianness, arrays of strings...), those that we would potentially like to eventually support (we would xfail these) and we edited some removing the parts we were not going to support but leaving the rest (e.g. tests that would iterate over all the datatypes, we would remove long double and int16 and so on, while keeping the rest).

I think it'd be rather difficult to separate the signal from the noise if this process were to be automatised.

ezyang · 2023-07-29T02:56:01Z

Disappointing but understandable

lezcano · 2023-07-29T03:33:59Z

cc @williamwen42 for all the dynamo errors in 3.11. I will try to file self-contained issues next week.

janeyx99 · 2023-07-29T06:44:57Z

@lezcano Yea I don’t feel strongly at all about having the test suite be strictly following PyTorch test infra (vs sticking to numpy’s) and what you and Ed said makes sense regarding expanding the scope of our tests. My observation is moreso that this change would certainly introduce lots of tests that vary from what PyTorch has been used to. If our current test infra (and people on the DevInfra team would know more) already automatically supports these tests with our features (like disable bot, test reporting, …), then that’s great! If not, then certain things may have to change in PyTorch test infra or there might be constraints worth bringing to light earlier rather than later.

…el ends in _impl'

…umPy dtypes

lezcano · 2023-08-08T20:32:45Z

oopsie daisy

facebook-github-bot · 2023-08-09T18:10:36Z

@ezyang has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2023-08-10T13:36:59Z

@ezyang has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2023-08-10T14:52:24Z

@ezyang has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2023-08-11T00:37:15Z

@pytorchbot merge

(Initiating merge automatically since Phabricator Diff has merged)

pytorchmergebot · 2023-08-11T00:39:24Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

This reverts commit a9dca53.

atalman · 2023-08-15T01:18:18Z

This is failing windows smoke tests: https://github.com/pytorch/pytorch/actions/runs/5830091528/job/15823773511#step:13:471
As per discussion we don't want to keep hard dependency on numpy now

As discussed in the review of #106211, remove several noops (#106211 (review) and #106211 (review)). Pull Request resolved: #107596 Approved by: https://github.com/lezcano

lezcano added module: numpy Related to numpy support, and also numpy compatibility of our operators module: dynamo release notes: dynamo labels Jul 28, 2023

lezcano requested review from albanD, ezyang and larryliu0820 July 28, 2023 15:59

github-actions bot added the ciflow/inductor label Jul 28, 2023

lezcano mentioned this pull request Jul 28, 2023

NumPy support in TorchDynamo pytorch/rfcs#54

Merged

pytorchbot added the open source label Jul 28, 2023

lezcano force-pushed the torch_np branch from 417518c to e5cdddb Compare July 28, 2023 17:28

larryliu0820 reviewed Jul 28, 2023

View reviewed changes

test/dynamo/test_misc.py Outdated Show resolved Hide resolved

test/dynamo/test_misc.py Outdated Show resolved Hide resolved

test/dynamo/test_misc.py Show resolved Hide resolved

torch/_dynamo/variables/tensor.py Show resolved Hide resolved

torch/_numpy/__init__.py Show resolved Hide resolved

lezcano force-pushed the torch_np branch 2 times, most recently from 0c7925f to 9cb447c Compare July 29, 2023 01:58

lezcano force-pushed the torch_np branch from 0a595b7 to c7b4050 Compare July 29, 2023 02:52

lezcano added 7 commits August 8, 2023 20:27

Make min and max homogeneous wrt NumPy / PyTorch variables

d508a40

Homogeneize PyTorch and NumPy variables

978a93f

Update graph breaks

41711e5

Skip detectron with dynamic shapes

c4dc727

Follow our own guidelines of 'everything thats works at a PyTorch lev…

bc9add5

…el ends in _impl'

Add README

bcbdcc3

Add config options in 'dynamo.config' that allow to set the default N…

62bf140

…umPy dtypes

lezcano force-pushed the torch_np branch from 04a15a1 to 62bf140 Compare August 8, 2023 20:27

Merge remote-tracking branch 'origin/main' into torch_np

237489d

pytorchmergebot added the merging label Aug 11, 2023

pytorchmergebot added Merged and removed merging labels Aug 11, 2023

pytorchmergebot closed this in a9dca53 Aug 11, 2023

atalman added a commit to atalman/pytorch that referenced this pull request Aug 11, 2023

Revert "NumPy support in torch.compile (pytorch#106211)"

70b7100

This reverts commit a9dca53.

atalman mentioned this pull request Aug 14, 2023

Add Numpy import guard - fix windows smoke tests #107148

Closed

atalman mentioned this pull request Aug 15, 2023

PyTorch failing when numpy is not installed #107228

Closed

This was referenced Aug 16, 2023

Translation layer (similar to torch_np) that can reliably lift Python operations into Tensor operations #107277

Closed

Torch Dynamo Error when Eager-Compiling Diffusers Model #107437

Closed

ev-br mentioned this pull request Aug 21, 2023

torch._numpy: remove noops and half-implemented nan-functions #107596

Closed

github-actions bot deleted the torch_np branch February 9, 2025 02:07

soumith mentioned this pull request Jul 3, 2025

torch.compile with numpy code differs from numpy's behavior #157569

Closed

NumPy support in torch.compile #106211

NumPy support in torch.compile #106211

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/106211

❌ 1 New Failure, 4 Unrelated Failures

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Merge started

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

16 participants