Decomp for aten.dropout #106274

SherlockNoMad · 2023-07-30T18:51:18Z

When exporting dropout with cpu tensor, we get following graph module

    class GraphModule(torch.nn.Module):
        def forward(self, arg0_1: f32[512, 10]):
            empty_memory_format: f32[512, 10] = torch.ops.aten.empty.memory_format([512, 10], dtype = torch.float32, layout = torch.strided, device = device(type='cpu'), pin_memory = False, memory_format = torch.contiguous_format)
            bernoulli_p: f32[512, 10] = torch.ops.aten.bernoulli.p(empty_memory_format, 0.9);  empty_memory_format = None
            div_scalar: f32[512, 10] = torch.ops.aten.div.Scalar(bernoulli_p, 0.9);  bernoulli_p = None
            mul_tensor: f32[512, 10] = torch.ops.aten.mul.Tensor(arg0_1, div_scalar);  arg0_1 = div_scalar = None
            return (mul_tensor,)

In addition, if we export with eval() mode, we will have an empty graph.

However, when exporting with cuda tensor, we got

    class GraphModule(torch.nn.Module):
        def forward(self, arg0_1: f32[512, 10]):
            native_dropout_default = torch.ops.aten.native_dropout.default(arg0_1, 0.1, True);  arg0_1 = None
            getitem: f32[512, 10] = native_dropout_default[0];  native_dropout_default = None
            return (getitem,)

and exporting under eval() mode will still have a dropout node in graph.

This PR make exporting with CPU tensor also produce aten.native_dropout.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @ngimel @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov

pytorch-bot · 2023-07-30T18:51:20Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/106274

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ 2 Unrelated Failures

As of commit 1bbb40a with merge base de8a91f ():

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

cuda12.1-py3.10-gcc9-sm86 / test (inductor_torchbench, 1, 1, linux.g5.4xlarge.nvidia.gpu) (gh)

UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:

linux-focal-rocm5.6-py3.8 / test (default, 2, 3, linux.rocm.gpu, unstable) (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

torch/_refs/nn/functional/__init__.py

ezyang · 2023-07-31T16:55:27Z

I'm down, if you can get it to pass tests...

ezyang · 2023-08-22T21:13:08Z

torch/_decomp/decompositions.py

+    if train and p != 0:
+        return aten.native_dropout(input, p, train)[0]
+    else:
+        return input


I hope this is what the real op does (return the input directly), because this can cause some gnarly bugs if it isn't (especially because you are registering a regular decomp for this op)

@bdhirsh I remember you mentioned that native_dropout returns a clone in both train and eval branches. Is this OK?

both native_dropout_cpu and native_droput_cuda returns a

output = input.clone();

IIUC, this is different from returning the input direction. This is a difference instance of the tensor with different storage.

I need to update this to return input.clone()

SherlockNoMad · 2023-08-22T23:47:31Z

@pytorchbot merge

pytorchmergebot · 2023-08-22T23:49:05Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-08-22T23:49:09Z

Merge failed

Reason: 1 mandatory check(s) failed. The first few are:

pull / linux-focal-cuda12.1-py3.10-gcc9-sm86 / test (default, 1, 5, linux.g5.4xlarge.nvidia.gpu)

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

SherlockNoMad · 2023-08-23T20:59:26Z

@pytorchbot merge

pytorchmergebot · 2023-08-23T21:02:19Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…108141) A previous PR #106274 decomposes `aten.dropout` and would create a `clone()` when `eval()` or `p=0`. This makes many SDPA-related models fail to match fused_attention pattern matchers. This PR adds new fused_attention pattern matchers with an additional clone to re-enable the SDPA op matching. Pull Request resolved: #108141 Approved by: https://github.com/jgong5, https://github.com/eellison

…ytorch#108141) A previous PR pytorch#106274 decomposes `aten.dropout` and would create a `clone()` when `eval()` or `p=0`. This makes many SDPA-related models fail to match fused_attention pattern matchers. This PR adds new fused_attention pattern matchers with an additional clone to re-enable the SDPA op matching. Pull Request resolved: pytorch#108141 Approved by: https://github.com/jgong5, https://github.com/eellison

…108141) (#108327) A previous PR #106274 decomposes `aten.dropout` and would create a `clone()` when `eval()` or `p=0`. This makes many SDPA-related models fail to match fused_attention pattern matchers. This PR adds new fused_attention pattern matchers with an additional clone to re-enable the SDPA op matching. Pull Request resolved: #108141 Approved by: https://github.com/jgong5, https://github.com/eellison

Summary: #106274 added a decomp for dropout such that for train mode we get `aten.native_dropout`, while for eval mode we get `aten.clone`. This commit makes it such that we get `aten.native_dropout` for both train and eval modes. For eval mode, we let the cloning happen in the aten op itself. The main motivation behind this change is QAT, which needs to swap between `aten.native_dropout(train=True)` and `aten.native_dropout(train=False)` in the graph. This was previously difficult to do since there was no dropout op to match and replace in eval mode. Test Plan: python test/test_ops.py Reviewers: SherlockNoMad, bdhirsh Subscribers: SherlockNoMad, bdhirsh, supriyar [ghstack-poisoned]

Summary: #106274 added a decomp for dropout such that for train mode we get `aten.native_dropout`, while for eval mode we get `aten.clone`. This commit makes it such that we get `aten.native_dropout` for both train and eval modes. For eval mode, we let the cloning happen in the aten op itself. The main motivation behind this change is QAT, which needs to swap between `aten.native_dropout(train=True)` and `aten.native_dropout(train=False)` in the graph. This was previously difficult to do since there was no dropout op to match and replace in eval mode. Test Plan: python test/test_ops.py Reviewers: SherlockNoMad, bdhirsh Subscribers: SherlockNoMad, bdhirsh, supriyar ghstack-source-id: c899961 Pull Request resolved: #109914

…s well" Summary: #106274 added a decomp for dropout such that for train mode we get `aten.native_dropout`, while for eval mode we get `aten.clone`. This commit makes it such that we get `aten.native_dropout` for both train and eval modes. For eval mode, we let the cloning happen in the aten op itself. The main motivation behind this change is QAT, which needs to swap between `aten.native_dropout(train=True)` and `aten.native_dropout(train=False)` in the graph. This was previously difficult to do since there was no dropout op to match and replace in eval mode. Test Plan: python test/test_ops.py Reviewers: SherlockNoMad, bdhirsh Subscribers: SherlockNoMad, bdhirsh, supriyar [ghstack-poisoned]

Summary: #106274 added a decomp for dropout such that for train mode we get `aten.native_dropout`, while for eval mode we get `aten.clone`. This commit makes it such that we get `aten.native_dropout` for both train and eval modes. For eval mode, we let the cloning happen in the aten op itself. The main motivation behind this change is QAT, which needs to swap between `aten.native_dropout(train=True)` and `aten.native_dropout(train=False)` in the graph. This was previously difficult to do since there was no dropout op to match and replace in eval mode. Test Plan: python test/test_ops.py Reviewers: SherlockNoMad, bdhirsh Subscribers: SherlockNoMad, bdhirsh, supriyar [ghstack-poisoned]

…s well" Summary: #106274 added a decomp for dropout such that for train mode we get `aten.native_dropout`, while for eval mode we get `aten.clone`. This commit makes it such that we get `aten.native_dropout` for both train and eval modes. For eval mode, we let the cloning happen in the aten op itself. The main motivation behind this change is QAT, which needs to swap between `aten.native_dropout(train=True)` and `aten.native_dropout(train=False)` in the graph. This was previously difficult to do since there was no dropout op to match and replace in eval mode. Test Plan: python test/test_ops.py Reviewers: SherlockNoMad, bdhirsh Subscribers: SherlockNoMad, bdhirsh, supriyar [ghstack-poisoned]

Summary: #106274 added a decomp for dropout such that for train mode we get `aten.native_dropout`, while for eval mode we get `aten.clone`. This commit makes it such that we get `aten.native_dropout` for both train and eval modes. For eval mode, we let the cloning happen in the aten op itself. The main motivation behind this change is QAT, which needs to swap between `aten.native_dropout(train=True)` and `aten.native_dropout(train=False)` in the graph. This was previously difficult to do since there was no dropout op to match and replace in eval mode. Test Plan: python test/test_ops.py Reviewers: SherlockNoMad, bdhirsh Subscribers: SherlockNoMad, bdhirsh, supriyar [ghstack-poisoned]

Summary: #106274 added a decomp for dropout such that for train mode we get `aten.native_dropout`, while for eval mode we get `aten.clone`. This commit makes it such that we get `aten.native_dropout` for both train and eval modes. For eval mode, we let the cloning happen in the aten op itself. The main motivation behind this change is QAT, which needs to swap between `aten.native_dropout(train=True)` and `aten.native_dropout(train=False)` in the graph. This was previously difficult to do since there was no dropout op to match and replace in eval mode. Test Plan: python test/test_ops.py Reviewers: SherlockNoMad, bdhirsh Subscribers: SherlockNoMad, bdhirsh, supriyar ghstack-source-id: c7635cf Pull Request resolved: #109914

…s well" Summary: #106274 added a decomp for dropout such that for train mode we get `aten.native_dropout`, while for eval mode we get `aten.clone`. This commit makes it such that we get `aten.native_dropout` for both train and eval modes. For eval mode, we let the cloning happen in the aten op itself. The main motivation behind this change is QAT, which needs to swap between `aten.native_dropout(train=True)` and `aten.native_dropout(train=False)` in the graph. This was previously difficult to do since there was no dropout op to match and replace in eval mode. Test Plan: python test/test_ops.py Reviewers: SherlockNoMad, bdhirsh Subscribers: SherlockNoMad, bdhirsh, supriyar cc avikchaudhuri gmagogsfm zhxchen17 tugsbayasgalan angelayi suo ydwu4 [ghstack-poisoned]

Summary: #106274 added a decomp for dropout such that for train mode we get `aten.native_dropout`, while for eval mode we get `aten.clone`. This commit makes it such that we get `aten.native_dropout` for both train and eval modes. For eval mode, we let the cloning happen in the aten op itself. The main motivation behind this change is QAT, which needs to swap between `aten.native_dropout(train=True)` and `aten.native_dropout(train=False)` in the graph. This was previously difficult to do since there was no dropout op to match and replace in eval mode. Test Plan: python test/test_ops.py Reviewers: SherlockNoMad, bdhirsh Subscribers: SherlockNoMad, bdhirsh, supriyar cc avikchaudhuri gmagogsfm zhxchen17 tugsbayasgalan angelayi suo ydwu4 [ghstack-poisoned]

…s well" Summary: #106274 added a decomp for dropout such that for train mode we get `aten.native_dropout`, while for eval mode we get `aten.clone`. This commit makes it such that we get `aten.native_dropout` for both train and eval modes. For eval mode, we let the cloning happen in the aten op itself. The main motivation behind this change is QAT, which needs to swap between `aten.native_dropout(train=True)` and `aten.native_dropout(train=False)` in the graph. This was previously difficult to do since there was no dropout op to match and replace in eval mode. Test Plan: python test/test_ops.py Reviewers: SherlockNoMad, bdhirsh Subscribers: SherlockNoMad, bdhirsh, supriyar cc avikchaudhuri gmagogsfm zhxchen17 tugsbayasgalan angelayi suo ydwu4 [ghstack-poisoned]

Summary: #106274 added a decomp for dropout such that for train mode we get `aten.native_dropout`, while for eval mode we get `aten.clone`. This commit makes it such that we get `aten.native_dropout` for both train and eval modes. For eval mode, we let the cloning happen in the aten op itself. The main motivation behind this change is QAT, which needs to swap between `aten.native_dropout(train=True)` and `aten.native_dropout(train=False)` in the graph. This was previously difficult to do since there was no dropout op to match and replace in eval mode. Test Plan: python test/test_ops.py Reviewers: SherlockNoMad, bdhirsh Subscribers: SherlockNoMad, bdhirsh, supriyar cc avikchaudhuri gmagogsfm zhxchen17 tugsbayasgalan angelayi suo ydwu4 [ghstack-poisoned]

…s well" Summary: #106274 added a decomp for dropout such that for train mode we get `aten.native_dropout`, while for eval mode we get `aten.clone`. This commit makes it such that we get `aten.native_dropout` for both train and eval modes. For eval mode, we let the cloning happen in the aten op itself. The main motivation behind this change is QAT, which needs to swap between `aten.native_dropout(train=True)` and `aten.native_dropout(train=False)` in the graph. This was previously difficult to do since there was no dropout op to match and replace in eval mode. Test Plan: python test/test_ops.py Reviewers: SherlockNoMad, bdhirsh Subscribers: SherlockNoMad, bdhirsh, supriyar cc avikchaudhuri gmagogsfm zhxchen17 tugsbayasgalan angelayi suo ydwu4 [ghstack-poisoned]

Summary: #106274 added a decomp for dropout such that for train mode we get `aten.native_dropout`, while for eval mode we get `aten.clone`. This commit makes it such that we get `aten.native_dropout` for both train and eval modes. For eval mode, we let the cloning happen in the aten op itself. The main motivation behind this change is QAT, which needs to swap between `aten.native_dropout(train=True)` and `aten.native_dropout(train=False)` in the graph. This was previously difficult to do since there was no dropout op to match and replace in eval mode. Test Plan: python test/test_ops.py Reviewers: SherlockNoMad, bdhirsh Subscribers: SherlockNoMad, bdhirsh, supriyar cc avikchaudhuri gmagogsfm zhxchen17 tugsbayasgalan angelayi suo ydwu4 [ghstack-poisoned]

github-actions bot added the ciflow/inductor label Jul 30, 2023

SherlockNoMad requested review from ezyang and zhxchen17 July 30, 2023 18:52

SherlockNoMad force-pushed the bahuang/dropout branch from 5cfd131 to ac13a72 Compare July 30, 2023 19:29

SherlockNoMad requested review from mruberry and ngimel as code owners July 30, 2023 19:29

SherlockNoMad force-pushed the bahuang/dropout branch from ac13a72 to edef9d8 Compare July 30, 2023 22:39

github-actions bot added the module: inductor label Jul 30, 2023

peterbell10 reviewed Jul 31, 2023

View reviewed changes

torch/_refs/nn/functional/__init__.py Show resolved Hide resolved

SherlockNoMad force-pushed the bahuang/dropout branch from edef9d8 to 7401fcc Compare July 31, 2023 19:01

SherlockNoMad added the topic: not user facing topic category label Jul 31, 2023

SherlockNoMad force-pushed the bahuang/dropout branch from 7401fcc to 42f0f77 Compare August 1, 2023 03:23

andrewor14 mentioned this pull request Aug 22, 2023

Decomp for aten.dropout #107716

Closed

SherlockNoMad force-pushed the bahuang/dropout branch from 42f0f77 to ec9e960 Compare August 22, 2023 20:14

github-actions bot added the release notes: quantization release notes category label Aug 22, 2023

ezyang reviewed Aug 22, 2023

View reviewed changes

ezyang approved these changes Aug 22, 2023

View reviewed changes

SherlockNoMad force-pushed the bahuang/dropout branch 2 times, most recently from 1f9bf22 to 442d74c Compare August 22, 2023 22:44

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Aug 22, 2023

pytorchmergebot added the merging label Aug 22, 2023

pytorchmergebot removed the merging label Aug 22, 2023

Decomp for aten.dropout

1bbb40a

SherlockNoMad force-pushed the bahuang/dropout branch from 442d74c to 1bbb40a Compare August 23, 2023 18:21

pytorchmergebot added the merging label Aug 23, 2023

pytorchmergebot added Merged and removed merging labels Aug 23, 2023

pytorchmergebot closed this in ee4b99c Aug 23, 2023

Valentine233 mentioned this pull request Aug 29, 2023

[Inductor] Add fused_attention pattern matcher with additional clone #108141

Closed

Valentine233 mentioned this pull request Aug 31, 2023

[v.2.1.0] Release Tracker #108055

Closed

andrewor14 mentioned this pull request Sep 22, 2023

Decompose to native_dropout in eval mode as well #109914

Closed

andrewor14 mentioned this pull request Dec 12, 2023

Dropout traces poorly with AotAutograd/make_fx #97894

Open

github-actions bot deleted the bahuang/dropout branch February 22, 2025 02:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Decomp for aten.dropout #106274

Decomp for aten.dropout #106274

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Decomp for aten.dropout #106274

Decomp for aten.dropout #106274

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/106274

✅ 2 Unrelated Failures

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Merge started

Uh oh!

Merge failed

Uh oh!

Uh oh!

Merge started

Uh oh!

Uh oh!