[MPS][TYPE_PROMOTION] Fix Clamp #130226

qqaatw · 2024-07-08T01:05:47Z

Stack from ghstack (oldest at bottom):

-> [MPS][TYPE_PROMOTION] Fix Clamp #130226

Summary:

Fixed tensor.clamp produces wrong values on MPS #130201 by adding type promotion.
Added proper tests.
Found torch's type promotion is different from numpy as follows:

import torch
import numpy as np
np.clip(np.array([1], dtype=np.float32), np.array([1], dtype=np.int32), None).dtype  # dtype('float64')
torch.clamp(torch.tensor([1], dtype=torch.float32), torch.tensor([1], dtype=torch.int32)).dtype  # torch.float32

~~Not sure the proper way to handle it, it causes numpy ref tests to fail.~~
Reason here, so think I'm gonna xfail it:

pytorch/test/test_ops.py

Lines 260 to 264 in 3c1cf03

    
           # Tests that the function and its (ndarray-accepting) reference produce the same 
        
           #   values on the tensors from sample_inputs func for the corresponding op. 
        
           # This test runs in double and complex double precision because 
        
           # NumPy does computation internally using double precision for many functions 
        
           # resulting in possible equality check failures.

[ghstack-poisoned]

pytorch-bot · 2024-07-08T01:05:50Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/130226

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit 573a0ad with merge base f85bda8 ():

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / linux-focal-cuda12.1-py3.10-gcc9-sm86 / test (default, 1, 5, linux.g5.4xlarge.nvidia.gpu) (gh) (trunk failure)
inductor/test_flex_attention.py::TestFlexAttention::test_fw_bw_graph_correctness
trunk / linux-focal-cuda12.4-py3.10-gcc9-sm86 / test (default, 1, 5, linux.g5.4xlarge.nvidia.gpu) (gh) (trunk failure)
inductor/test_flex_attention.py::TestFlexAttention::test_fw_bw_graph_correctness

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

ghstack-source-id: cb295f4 Pull Request resolved: #130226

malfet · 2024-07-10T01:45:04Z

aten/src/ATen/native/mps/operations/TensorCompare.mm

-                         shape:(mps::getMPSShape(input_t))dataType:(mps::getMPSScalarType(input_t.scalar_type()))];
+        newCachedGraph->minTensor =
+            [mpsGraph constantWithScalar:min_scalar
+                                   shape:(mps::getMPSShape(input_t))dataType:(mps::getMPSScalarType(result_type))];


Suggested change

shape:(mps::getMPSShape(input_t))dataType:(mps::getMPSScalarType(result_type))];

shape:mps::getMPSShape(input_t)

dataType:mps::getMPSScalarType(result_type)];

malfet · 2024-07-10T01:45:23Z

aten/src/ATen/native/mps/operations/TensorCompare.mm

-                         shape:(mps::getMPSShape(input_t))dataType:(mps::getMPSScalarType(input_t.scalar_type()))];
+        newCachedGraph->maxTensor =
+            [mpsGraph constantWithScalar:max_scalar
+                                   shape:(mps::getMPSShape(input_t))dataType:(mps::getMPSScalarType(result_type))];


Suggested change

shape:(mps::getMPSShape(input_t))dataType:(mps::getMPSScalarType(result_type))];

shape:mps::getMPSShape(input_t)

dataType:mps::getMPSScalarType(result_type)];

malfet · 2024-07-10T01:46:27Z

@pytorchbot merge -i

pytorchmergebot · 2024-07-10T01:48:12Z

Merge started

Your change will be merged while ignoring the following 2 checks: trunk / win-vs2019-cpu-py3 / test (default, 1, 3, windows.4xlarge.nonephemeral), trunk / win-vs2019-cpu-py3 / test (default, 3, 3, windows.4xlarge.nonephemeral)

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-07-10T01:53:35Z

Merge failed

Reason: Command git -C /home/runner/work/pytorch/pytorch cherry-pick -x 1cd115ccd5eee01fd411865534ed9a9fa0983f48 returned non-zero exit code 1

Auto-merging test/test_mps.py
CONFLICT (content): Merge conflict in test/test_mps.py
Auto-merging torch/testing/_internal/common_methods_invocations.py
error: could not apply 1cd115ccd5... [MPS][TYPE_PROMOTION] Fix Clamp
hint: After resolving the conflicts, mark them with
hint: "git add/rm <pathspec>", then run
hint: "git cherry-pick --continue".
hint: You can instead skip this commit with "git cherry-pick --skip".
hint: To abort and get back to the state before "git cherry-pick",
hint: run "git cherry-pick --abort".
hint: Disable this message with "git config advice.mergeConflict false"

Details for Dev Infra team

Raised by workflow job

[ghstack-poisoned]

ghstack-source-id: 8ef1251 Pull Request resolved: #130226

qqaatw · 2024-07-10T02:06:29Z

@pytorchbot merge

pytorchmergebot · 2024-07-10T02:08:02Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-07-10T08:06:56Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

malfet · 2024-07-10T14:25:45Z

@pytorchbot merge -f "Lint + MPS is green"

pytorchmergebot · 2024-07-10T14:27:32Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Summary: 1. Fixed pytorch#130201 by adding type promotion. 2. Added proper tests. 3. Found torch's type promotion is different from numpy as follows: ```python import torch import numpy as np np.clip(np.array([1], dtype=np.float32), np.array([1], dtype=np.int32), None).dtype # dtype('float64') torch.clamp(torch.tensor([1], dtype=torch.float32), torch.tensor([1], dtype=torch.int32)).dtype # torch.float32 ``` ~Not sure the proper way to handle it, it causes numpy ref tests to fail.~ Reason here, so think I'm gonna xfail it: https://github.com/pytorch/pytorch/blob/3c1cf03fde145bdbe1f5ffb81765d076c10b4c04/test/test_ops.py#L260-L264 Pull Request resolved: pytorch#130226 Approved by: https://github.com/malfet

qqaatw · 2024-08-11T04:32:40Z

@malfet can we cherry-pick this PR to v2.4.1. People are reporting issues: #133100

qqaatw · 2024-08-12T19:49:28Z

@pytorchbot help

pytorch-bot · 2024-08-12T19:49:31Z

❌ 🤖 pytorchbot command failed:

@pytorchbot: error: argument command: invalid choice: 'help' (choose from 'merge', 'revert', 'rebase', 'label', 'drci', 'cherry-pick', 'close')

usage: @pytorchbot [-h] {merge,revert,rebase,label,drci,cherry-pick,close} ...

Try @pytorchbot --help for more info.

qqaatw · 2024-08-12T19:49:49Z

@pytorchbot -h cherry-pick

pytorch-bot · 2024-08-12T19:49:51Z

❌ 🤖 pytorchbot command failed:

@pytorchbot cherry-pick: error: the following arguments are required: --onto, -c/--classification

usage: @pytorchbot cherry-pick --onto ONTO [--fixes FIXES] -c
                               {regression,critical,fixnewfeature,docs,release}

Try @pytorchbot --help for more info.

qqaatw · 2024-08-12T19:52:35Z

@pytorchbot --help

pytorch-bot · 2024-08-12T19:52:37Z

PyTorchBot Help

usage: @pytorchbot [-h] {merge,revert,rebase,label,drci,cherry-pick,close} ...

In order to invoke the bot on your PR, include a line that starts with
@pytorchbot anywhere in a comment. That line will form the command; no
multi-line commands are allowed. Some commands may be used on issues as specified below.

Example:
    Some extra context, blah blah, wow this PR looks awesome

    @pytorchbot merge

optional arguments:
  -h, --help            Show this help message and exit.

command:
  {merge,revert,rebase,label,drci,cherry-pick,close}
    merge               Merge a PR
    revert              Revert a PR
    rebase              Rebase a PR
    label               Add label to a PR
    drci                Update Dr. CI
    cherry-pick         Cherry pick a PR onto a release branch
    close               Close a PR

Merge

usage: @pytorchbot merge [-f MESSAGE | -i] [-ic] [-r [{viable/strict,main}]]

Merge an accepted PR, subject to the rules in .github/merge_rules.json.
By default, this will wait for all required checks (lint, pull) to succeed before merging.

optional arguments:
  -f MESSAGE, --force MESSAGE
                        Merge without checking anything. This requires a reason for auditting purpose, for example:
                        @pytorchbot merge -f 'Minor update to fix lint. Expecting all PR tests to pass'
                        
                        Please use `-f` as last resort, prefer `--ignore-current` to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.
  -i, --ignore-current  Merge while ignoring the currently failing jobs.  Behaves like -f if there are no pending jobs.
  -ic                   Old flag for --ignore-current. Deprecated in favor of -i.
  -r [{viable/strict,main}], --rebase [{viable/strict,main}]
                        Rebase the PR to re run checks before merging.  Accepts viable/strict or main as branch options and will default to viable/strict if not specified.

Revert

usage: @pytorchbot revert -m MESSAGE -c
                          {nosignal,ignoredsignal,landrace,weird,ghfirst}

Revert a merged PR. This requires that you are a Meta employee.

Example:
  @pytorchbot revert -m="This is breaking tests on trunk. hud.pytorch.org/" -c=nosignal

optional arguments:
  -m MESSAGE, --message MESSAGE
                        The reason you are reverting, will be put in the commit message. Must be longer than 3 words.
  -c {nosignal,ignoredsignal,landrace,weird,ghfirst}, --classification {nosignal,ignoredsignal,landrace,weird,ghfirst}
                        A machine-friendly classification of the revert reason.

Rebase

usage: @pytorchbot rebase [-s | -b BRANCH]

Rebase a PR. Rebasing defaults to the stable viable/strict branch of pytorch.
Repeat contributor may use this command to rebase their PR.

optional arguments:
  -s, --stable          [DEPRECATED] Rebase onto viable/strict
  -b BRANCH, --branch BRANCH
                        Branch you would like to rebase to

Label

usage: @pytorchbot label labels [labels ...]

Adds label to a PR or Issue [Can be used on Issues]

positional arguments:
  labels  Labels to add to given Pull Request or Issue [Can be used on Issues]

Dr CI

usage: @pytorchbot drci 

Update Dr. CI. Updates the Dr. CI comment on the PR in case it's gotten out of sync with actual CI results.

cherry-pick

usage: @pytorchbot cherry-pick --onto ONTO [--fixes FIXES] -c
                               {regression,critical,fixnewfeature,docs,release}

Cherry pick a pull request onto a release branch for inclusion in a release

optional arguments:
  --onto ONTO           Branch you would like to cherry pick onto (Example: release/2.1)
  --fixes FIXES         Link to the issue that your PR fixes (Example: https://github.com/pytorch/pytorch/issues/110666)
  -c {regression,critical,fixnewfeature,docs,release}, --classification {regression,critical,fixnewfeature,docs,release}
                        A machine-friendly classification of the cherry-pick reason.

Close

usage: @pytorchbot close

Close a PR [Can be used on issues]

qqaatw · 2024-08-12T19:54:14Z

@pytorchbot cherry-pick --onto release/2.4 -c critical

Summary: 1. Fixed #130201 by adding type promotion. 2. Added proper tests. 3. Found torch's type promotion is different from numpy as follows: ```python import torch import numpy as np np.clip(np.array([1], dtype=np.float32), np.array([1], dtype=np.int32), None).dtype # dtype('float64') torch.clamp(torch.tensor([1], dtype=torch.float32), torch.tensor([1], dtype=torch.int32)).dtype # torch.float32 ``` ~Not sure the proper way to handle it, it causes numpy ref tests to fail.~ Reason here, so think I'm gonna xfail it: https://github.com/pytorch/pytorch/blob/3c1cf03fde145bdbe1f5ffb81765d076c10b4c04/test/test_ops.py#L260-L264 Pull Request resolved: #130226 Approved by: https://github.com/malfet (cherry picked from commit 99967e1)

pytorchbot · 2024-08-12T19:58:22Z

Cherry picking #130226

The cherry pick PR is at #133260 and it is recommended to link a critical cherry pick PR with an issue. The following tracker issues are updated:

[v2.4.1] Release Tracker #132400 (comment)

Details for Dev Infra team

Raised by workflow job

[MPS][TYPE_PROMOTION] Fix Clamp (#130226) Summary: 1. Fixed #130201 by adding type promotion. 2. Added proper tests. 3. Found torch's type promotion is different from numpy as follows: ```python import torch import numpy as np np.clip(np.array([1], dtype=np.float32), np.array([1], dtype=np.int32), None).dtype # dtype('float64') torch.clamp(torch.tensor([1], dtype=torch.float32), torch.tensor([1], dtype=torch.int32)).dtype # torch.float32 ``` ~Not sure the proper way to handle it, it causes numpy ref tests to fail.~ Reason here, so think I'm gonna xfail it: https://github.com/pytorch/pytorch/blob/3c1cf03fde145bdbe1f5ffb81765d076c10b4c04/test/test_ops.py#L260-L264 Pull Request resolved: #130226 Approved by: https://github.com/malfet (cherry picked from commit 99967e1) Co-authored-by: Li-Huai (Allan) Lin <qqaatw@gmail.com>

[MPS][TYPE_PROMOTION] Fix Clamp (pytorch#130226) Summary: 1. Fixed pytorch#130201 by adding type promotion. 2. Added proper tests. 3. Found torch's type promotion is different from numpy as follows: ```python import torch import numpy as np np.clip(np.array([1], dtype=np.float32), np.array([1], dtype=np.int32), None).dtype # dtype('float64') torch.clamp(torch.tensor([1], dtype=torch.float32), torch.tensor([1], dtype=torch.int32)).dtype # torch.float32 ``` ~Not sure the proper way to handle it, it causes numpy ref tests to fail.~ Reason here, so think I'm gonna xfail it: https://github.com/pytorch/pytorch/blob/3c1cf03fde145bdbe1f5ffb81765d076c10b4c04/test/test_ops.py#L260-L264 Pull Request resolved: pytorch#130226 Approved by: https://github.com/malfet (cherry picked from commit 99967e1) Co-authored-by: Li-Huai (Allan) Lin <qqaatw@gmail.com>

Update

cd02b7e

[ghstack-poisoned]

qqaatw requested review from kulinseth, malfet and mruberry as code owners July 8, 2024 01:05

pytorch-bot bot added ciflow/mps Run MPS tests (subset of trunk) release notes: mps Release notes category labels Jul 8, 2024

qqaatw requested a review from albanD July 8, 2024 01:09

pytorchbot added the open source label Jul 8, 2024

Update

1605a94

[ghstack-poisoned]

qqaatw added the ciflow/trunk Trigger trunk jobs on your pull request label Jul 8, 2024

Update

a2cdc36

[ghstack-poisoned]

Update

95a1fb8

[ghstack-poisoned]

qqaatw added a commit that referenced this pull request Jul 8, 2024

[MPS][TYPE_PROMOTION] Fix Clamp

1cd115c

ghstack-source-id: cb295f4 Pull Request resolved: #130226

malfet approved these changes Jul 10, 2024

View reviewed changes

malfet added the topic: bug fixes topic category label Jul 10, 2024

pytorchmergebot added the merging label Jul 10, 2024

pytorchmergebot removed the merging label Jul 10, 2024

Update

ad65560

[ghstack-poisoned]

Update

573a0ad

[ghstack-poisoned]

qqaatw added a commit that referenced this pull request Jul 10, 2024

[MPS][TYPE_PROMOTION] Fix Clamp

4043c20

ghstack-source-id: 8ef1251 Pull Request resolved: #130226

pytorchmergebot added the merging label Jul 10, 2024

pytorchmergebot added the Merged label Jul 10, 2024

pytorchmergebot closed this in 99967e1 Jul 10, 2024

pytorchmergebot removed the merging label Jul 10, 2024

github-actions bot deleted the gh/qqaatw/29/head branch August 10, 2024 01:58

qqaatw mentioned this pull request Aug 11, 2024

MPS clamp_min on older MacOS #133100

Open

pytorchbot mentioned this pull request Aug 12, 2024

[MPS][TYPE_PROMOTION] Fix Clamp #133260

Merged

pytorchbot mentioned this pull request Aug 12, 2024

[v2.4.1] Release Tracker #132400

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MPS][TYPE_PROMOTION] Fix Clamp #130226

[MPS][TYPE_PROMOTION] Fix Clamp #130226

	# Tests that the function and its (ndarray-accepting) reference produce the same
	# values on the tensors from sample_inputs func for the corresponding op.
	# This test runs in double and complex double precision because
	# NumPy does computation internally using double precision for many functions
	# resulting in possible equality check failures.

	shape:(mps::getMPSShape(input_t))dataType:(mps::getMPSScalarType(result_type))];
	shape:mps::getMPSShape(input_t)
	dataType:mps::getMPSScalarType(result_type)];

[MPS][TYPE_PROMOTION] Fix Clamp #130226

[MPS][TYPE_PROMOTION] Fix Clamp #130226

Conversation

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/130226

✅ You can merge normally! (2 Unrelated Failures)

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Merge started

Merge failed

Merge started

Merge started

PyTorchBot Help

Merge

Revert

Rebase

Label

Dr CI

cherry-pick

Close

Cherry picking #130226