Pass `torch.load(weights_only=)` internally to avoid FutureWarning #130663

awaelchli · 2024-07-13T01:20:47Z

cc @XilunWu @H-Huang @awgu @kwen2501 @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k @c-p-i-o @LucasLLC @MeetVadakkanchery @mhorowitz

pytorch-bot · 2024-07-13T01:20:50Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/130663

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit e0275f8 with merge base 9df4bc6 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ppwwyyxx · 2024-07-14T00:11:00Z

There is also:

torch24/lib/python3.10/site-packages/torch/storage.py:414: FutureWarning: You are using `torch.load` with `weights_only=False` (the current default value), which uses the default pickle
 module implicitly. It is possible to construct malicious pickle
8000
 data which will execute arbitrary code during unpickling (See https://github.com/pytorch/pytorch/blob/main/SECURITY.md#untrusted-models for more det
ails). In a future release, the default value for `weights_only` will be flipped to `True`. This limits the functions that could be executed during unpickling. Arbitrary objects will no longer be allowed to be loa
ded via this mode unless they are explicitly allowlisted by the user via `torch.serialization.add_safe_globals`. We recommend you start setting `weights_only=True` for any use case where you don't have full contro
l of the loaded file. Please open an issue on GitHub for any issues related to this experimental feature.
  return torch.load(io.BytesIO(b))

awaelchli · 2024-07-14T10:13:35Z

@ppwwyyxx I believe you are referring to this line, that's already taken care of by a previous fix.

malfet

Sure, but why not change it to True? Are you expecting to store something that is unsafe?

mikaylagawarecki

For weights_only=True, if there is a known list of GLOBALs that might be encountered (e.g. for DTensor) that we expect to be safe we can use the

with torch.serialization.safe_globals([bla]): context manager

e.g. for DTensor, [DTensor, DeviceMesh, Shard, DTensorSpec, TensorMeta] might be needed

awaelchli · 2024-07-15T16:55:35Z

torch/distributed/optim/zero_redundancy_optimizer.py

@@ -108,7 +108,7 @@ def _broadcast_object(
        )
        dist.broadcast(data_recv_tensor, src=src_rank, group=group, async_op=False)
        buffer = io.BytesIO(data_recv_tensor.cpu().numpy())
-        obj = torch.load(buffer, map_location=device)
+        obj = torch.load(buffer, map_location=device, weights_only=False)


From the code above, we can see this is broadcasting an object pickled to bytes, so loading here means unpickling. The function doesn't make any assumptions like obj being a tensor, so we need to set weights_only=False here.

torch/distributed/checkpoint/filesystem.py

awaelchli · 2024-07-15T17:00:46Z

@mikaylagawarecki @malfet The reason I set them to False is simply to ensure I don't break anything, since False was implicitly the default before. For the distributed checkpoint utilities, I don't know whether we can make the assumptions that only tensors are in the checkpoint.

wz337 · 2024-07-15T18:28:49Z

@mikaylagawarecki @malfet The reason I set them to False is simply to ensure I don't break anything, since False was implicitly the default before. For the distributed checkpoint utilities, I don't know whether we can make the assumptions that only tensors are in the checkpoint.

Switching to weights_only=True would probably break things. One use case is for TrainingState here. https://github.com/pytorch/pytorch/blob/main/test/distributed/checkpoint/e2e/test_e2e_save_and_load.py#L100
Also, we are not sure whether users have similar use case so if we make the switch now, users might experience disruption.

LucasLLC

lgtm!

awaelchli · 2024-07-15T20:01:02Z

Could someone approve the CI workflow again, thx. I didn't know my push would disable it again.

awaelchli · 2024-07-15T22:43:18Z

@pytorchbot merge

pytorchmergebot · 2024-07-15T22:46:26Z

Merge failed

Reason: This PR needs a release notes: label
If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Details for Dev Infra team

Raised by workflow job

awaelchli · 2024-07-15T22:53:39Z

@pytorchbot label "release notes: python_frontend"

awaelchli · 2024-07-16T01:16:39Z

@pytorchbot merge

pytorchmergebot · 2024-07-16T01:19:16Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…130663) Fixes #130658 Pull Request resolved: #130663 Approved by: https://github.com/malfet, https://github.com/LucasLLC

…ytorch#130663) Fixes pytorch#130658 Pull Request resolved: pytorch#130663 Approved by: https://github.com/malfet, https://github.com/LucasLLC

atalman · 2024-08-15T17:49:41Z

@pytorchbot cherry-pick --onto release/2.4 -c critical --fixes #130658

…130663) Fixes #130658 Pull Request resolved: #130663 Approved by: https://github.com/malfet, https://github.com/LucasLLC (cherry picked from commit ad314a2)

pytorchbot · 2024-08-15T17:53:38Z

Cherry picking #130663

The cherry pick PR is at #133594 and it is linked with issue #130658. The following tracker issues are updated:

[v2.4.1] Release Tracker #132400 (comment)

Details for Dev Infra team

Raised by workflow job

…133594) Pass `torch.load(weights_only=)` internally to avoid FutureWarning (#130663) Fixes #130658 Pull Request resolved: #130663 Approved by: https://github.com/malfet, https://github.com/LucasLLC (cherry picked from commit ad314a2) Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>

Avoid weights_only warning

345cd72

pytorch-bot bot added module: distributed_checkpoint oncall: distributed Add this issue/PR to distributed oncall triage queue labels Jul 13, 2024

pytorchbot added the open source label Jul 13, 2024

bdhirsh requested a review from mikaylagawarecki July 13, 2024 01:51

bdhirsh added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Jul 13, 2024

awaelchli added 2 commits July 15, 2024 17:55

lintrunner

da424bb

Merge branch 'main' into bugfix-torch-load-warning

b43aca5

malfet approved these changes Jul 15, 2024

View reviewed changes

mikaylagawarecki reviewed Jul 15, 2024

View reviewed changes

awaelchli commented Jul 15, 2024

View reviewed changes

switch to weights_only=True in filesystem.py

e0275f8

awaelchli mentioned this pull request Jul 15, 2024

Internal uses of torch.load are missing weights_only and raise FutureWarning #130658

Closed

LucasLLC approved these changes Jul 15, 2024

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jul 15, 2024

pytorchmergebot added the merging label Jul 15, 2024

pytorchmergebot removed the merging label Jul 15, 2024

pytorch-bot bot added the release notes: python_frontend python frontend release notes category label Jul 15, 2024

pytorchmergebot added the merging label Jul 16, 2024

pytorchmergebot added the Merged label Jul 16, 2024

pytorchmergebot closed this in ad314a2 Jul 16, 2024

pytorchmergebot removed the merging label Jul 16, 2024

awaelchli deleted the bugfix-torch-load-warning branch July 16, 2024 01:24

mikaylagawarecki mentioned this pull request Jul 16, 2024

Fix warning when pickle.load torch.Storage #130246

Closed

henrylhtsang mentioned this pull request Jul 17, 2024

[aoti] Unskip some aot inductor tests #130973

Closed

mlazos pushed a commit that referenced this pull request Jul 18, 2024

Pass torch.load(weights_only=) internally to avoid FutureWarning (#…

914b9b9

…130663) Fixes #130658 Pull Request resolved: #130663 Approved by: https://github.com/malfet, https://github.com/LucasLLC

pytorchbot mentioned this pull request Aug 15, 2024

[v2.4.1] Release Tracker #132400

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Pass `torch.load(weights_only=)` internally to avoid FutureWarning #130663

Pass `torch.load(weights_only=)` internally to avoid FutureWarning #130663

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Pass torch.load(weights_only=) internally to avoid FutureWarning #130663

Pass torch.load(weights_only=) internally to avoid FutureWarning #130663

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/130663

✅ No Failures

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Merge failed

Uh oh!

Uh oh!

Uh oh!

Merge started

Uh oh!

Uh oh!

Cherry picking #130663

Uh oh!

Uh oh!

Pass `torch.load(weights_only=)` internally to avoid FutureWarning #130663

Pass `torch.load(weights_only=)` internally to avoid FutureWarning #130663