Fix memory leak on masked Tensor #137890

albanD · 2024-10-14T13:51:46Z

Note that this reverts the change from #137815 as well which is not needed anymore!

Without this, you create an unbeakable reference cycle. It is unbreakable because part of the cycle is through the autograd graph which we cannot traverse.

pytorch-bot · 2024-10-14T13:51:50Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/137890

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 41881d7 with merge base 0e4d426 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

atalman

lgtm

Skylion007 · 2024-10-14T15:01:49Z

Do our current tests catch that the unbreakable reference cycle is broken?

albanD · 2024-10-14T17:50:37Z

It's pretty hard to detect that on the cpu side without flakyness :/
But the GPU tests are pretty good at it when you leak a Tensor. That's why the original PR linked above did the change to begin with.

albanD · 2024-10-14T17:50:44Z

@pytorchbot merge

pytorchmergebot · 2024-10-14T17:53:47Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-10-14T17:53:55Z

Merge failed

Reason: 1 mandatory check(s) failed. The first few are:

pull / linux-focal-cuda12.1-py3.10-gcc9-sm86 / test (default, 4, 5, lf.linux.g5.4xlarge.nvidia.gpu)

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

cpuhrsch · 2024-10-15T17:13:34Z

@pytorchbot merge

pytorchmergebot · 2024-10-15T17:15:25Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Note that this reverts the change from #137815 as well which is not needed anymore! Without this, you create an unbeakable reference cycle. It is unbreakable because part of the cycle is through the autograd graph which we cannot traverse. Pull Request resolved: #137890 Approved by: https://github.com/atalman, https://github.com/huydhn, https://github.com/Skylion007

kit1980 · 2024-10-23T20:01:39Z

2.5.1 is an emergency patch release to address specific large regressions, moving this to 2.6.0

kit1980 · 2025-01-25T01:47:28Z

For release 2.6 I verified that the change is present in https://github.com/pytorch/pytorch/blob/v2.6.0-rc9/torch/masked/maskedtensor/core.py and thus tested in CI.

Fix memory leak on masked Tensor

41881d7

albanD requested review from huydhn and cpuhrsch October 14, 2024 13:51

atalman approved these changes Oct 14, 2024

View reviewed changes

huydhn approved these changes Oct 14, 2024

View reviewed changes

huydhn added the topic: not user facing topic category label Oct 14, 2024

Skylion007 approved these changes Oct 14, 2024

View reviewed changes

nowtryz mentioned this pull request Oct 14, 2024

Fix masked tensor test_stack memory leak #137815

Closed

Skylion007 added this to the 2.5.1 milestone Oct 14, 2024

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 14, 2024

pytorchmergebot added the merging label Oct 14, 2024

pytorchmergebot removed the merging label Oct 14, 2024

cpuhrsch added release notes: sparse release notes category topic: bug fixes topic category and removed topic: not user facing topic category labels Oct 14, 2024

pytorchmergebot added the merging label Oct 15, 2024

pytorchmergebot added the Merged label Oct 15, 2024

pytorchmergebot closed this in bf77f52 Oct 15, 2024

pytorchmergebot removed the merging label Oct 15, 2024

kit1980 modified the milestones: 2.5.1, 2.6.0 Oct 23, 2024

atalman mentioned this pull request Jan 13, 2025

Release 2.6.0 validations checklist and cherry-picks #144503

Closed

73 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix memory leak on masked Tensor #137890

Fix memory leak on masked Tensor #137890

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Fix memory leak on masked Tensor #137890

Fix memory leak on masked Tensor #137890

Uh oh!

Conversation

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/137890

✅ No Failures

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Merge started

Uh oh!

Merge failed

Uh oh!

Uh oh!

Merge started

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!