[FlexAttention] Fix device test instantation #151846

drisspg · 2025-04-21T22:45:52Z

Stack from ghstack (oldest at bottom):

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2025-04-21T22:45:56Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/151846

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 5 Pending

As of commit 7703789 with merge base cc793e8 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: d7b5533 Pull Request resolved: #151846

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 21f6341 Pull Request resolved: #151846

[ghstack-poisoned]

ghstack-source-id: fa73de9 Pull Request resolved: #151846

[ghstack-poisoned]

drisspg · 2025-04-22T04:07:21Z

test/inductor/test_flex_attention.py

-B = 4
-H = 8
-S = 2048
+B = 2


[ghstack-poisoned]

ghstack-source-id: 8dc2ecd Pull Request resolved: pytorch/pytorch#151846

[ghstack-poisoned]

test/inductor/test_flex_attention.py

[ghstack-poisoned]

liangan1 · 2025-04-23T01:41:49Z

test/inductor/test_flex_attention.py

@@ -90,6 +94,18 @@ def temp_float32_matmul_precision(precision: str):
        torch.set_float32_matmul_precision(original_precision)


+def skip_on_cpu(test_func):
+    """Decorator to skip tests that are not supported on CPU."""
+    decorated_func = skipCPUIf(True, "Not supported on CUDA")(test_func)


Suggested change

decorated_func = skipCPUIf(True, "Not supported on CUDA")(test_func)

decorated_func = skipCPUIf(True, "Not supported on CPU")(test_func)

There are alot of flaky test failures so will land in follow up, but goood catch

pytorchmergebot · 2025-04-23T04:13:04Z

Merge failed

Reason: Command git -C /home/runner/work/pytorch/pytorch rebase origin/main returned non-zero exit code 1

Rebasing (1/1)
Auto-merging test/inductor/test_flex_attention.py
CONFLICT (content): Merge conflict in test/inductor/test_flex_attention.py
error: could not apply 1e5673aa024... [FlexAttention] Fix device test instantation (#151846)
hint: Resolve all conflicts manually, mark them as resolved with
hint: "git add/rm <conflicted_files>", then run "git rebase --continue".
hint: You can instead skip this commit: run "git rebase --skip".
hint: To abort and get back to the state before "git rebase", run "git rebase --abort".
hint: Disable this message with "git config set advice.mergeConflict false"
Could not apply 1e5673aa024... [FlexAttention] Fix device test instantation (#151846)

Details for Dev Infra team

Raised by workflow job

[ghstack-poisoned]

drisspg · 2025-04-23T05:35:36Z

@pytorchbot merge -f "I ran the full CI everything was green and last minute merge conflict"

pytorchmergebot · 2025-04-23T05:37:11Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

jithunnair-amd · 2025-04-23T14:56:27Z

@drisspg Looks like this PR broke at least the following test in rocm workflow: inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_GQA_score_mod5_cuda_float16

https://hud.pytorch.org/hud/pytorch/pytorch/21b0ef520d651ed67f6978ac37c8a8a4093819ee/1?per_page=50&name_filter=rocm%20%2F&mergeEphemeralLF=true

I've added the ciflow/rocm label on this PR to surface the failure.

@pytorchbot revert -c nosignal -m "PR broke rocm workflow"

cc @huydhn

pytorchmergebot · 2025-04-23T14:58:03Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

pytorchmergebot · 2025-04-23T14:58:04Z

Don't want to revert based on edited command

jithunnair-amd · 2025-04-23T14:59:51Z

Trying again, since the edit to the comment seemed to displease the bot:

@drisspg Looks like this PR broke at least the following test in rocm workflow: inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_GQA_score_mod5_cuda_float16

https://hud.pytorch.org/hud/pytorch/pytorch/21b0ef520d651ed67f6978ac37c8a8a4093819ee/1?per_page=50&name_filter=rocm%20%2F&mergeEphemeralLF=true

I've added the ciflow/rocm label on this PR to surface the failure.

@pytorchbot revert -c nosignal -m "PR broke rocm workflow"

cc @huydhn

pytorchmergebot · 2025-04-23T15:01:22Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

This reverts commit b37fa20. Reverted #151846 on behalf of https://github.com/jithunnair-amd due to PR broke rocm workflow ([comment](#151846 (comment)))

pytorchmergebot · 2025-04-23T15:01:39Z

@drisspg your PR has been successfully reverted.

drisspg · 2025-04-23T15:09:31Z

@jithunnair-amd what is the failure?

[ghstack-poisoned]

pytorchmergebot · 2025-04-23T20:36:39Z

Starting merge as part of PR stack under #151959

Fixes: #148827 Pull Request resolved: #151959 Approved by: https://github.com/Chillee ghstack dependencies: #151846

Pull Request resolved: pytorch#151846 Approved by: https://github.com/Chillee, https://github.com/BoyuanFeng, https://github.com/mlazos

) Fixes: pytorch#148827 Pull Request resolved: pytorch#151959 Approved by: https://github.com/Chillee ghstack dependencies: pytorch#151846

Update

ee12609

[ghstack-poisoned]

drisspg mentioned this pull request Apr 17, 2025

[FlexAttention] Remove old constraint that was causing assert failure #151521

Closed

pytorch-bot bot added ciflow/inductor module: inductor topic: not user facing topic category labels Apr 21, 2025

drisspg added a commit that referenced this pull request Apr 21, 2025

[FlexAttention] Fix device test instantation

f203c3a

ghstack-source-id: d7b5533 Pull Request resolved: #151846

Update on "[FlexAttention] Fix device test instantation"

52bfd69

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

drisspg added a commit that referenced this pull request Apr 21, 2025

[FlexAttention] Fix device test instantation

f55bbeb

ghstack-source-id: 21f6341 Pull Request resolved: #151846

Update

54e533a

[ghstack-poisoned]

drisspg added a commit that referenced this pull request Apr 22, 2025

[FlexAttention] Fix device test instantation

a41518e

ghstack-source-id: fa73de9 Pull Request resolved: #151846

drisspg added 3 commits April 21, 2025 18:59

Update

a4a52e8

[ghstack-poisoned]

Update

0b26fec

[ghstack-poisoned]

Update

0cd2bcf

[ghstack-poisoned]

drisspg commented Apr 22, 2025

View reviewed changes

test/inductor/test_flex_attention.py

B = 4

H = 8

S = 2048

B = 2

Copy link

Contributor Author

drisspg Apr 22, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

purposeful

drisspg added 3 commits April 21, 2025 21:14

Update

e4c7fa4

[ghstack-poisoned]

Update

e32e5ef

[ghstack-poisoned]

Update

8628dee

[ghstack-poisoned]

Divigroup-RAP pushed a commit to Divigroup-RAP/PYTORCH that referenced this pull request Apr 22, 2025

[FlexAttention] Fix device test instantation

6565872

ghstack-source-id: 8dc2ecd Pull Request resolved: pytorch/pytorch#151846

drisspg added 3 commits April 22, 2025 10:44

Update

90dee5d

[ghstack-poisoned]

Update

2be3a05

[ghstack-poisoned]

Update

251e348

[ghstack-poisoned]

drisspg requested a review from BoyuanFeng April 22, 2025 19:23

Chillee approved these changes Apr 22, 2025

View reviewed changes

BoyuanFeng reviewed Apr 22, 2025

View reviewed changes

test/inductor/test_flex_attention.py Show resolved Hide resolved

test/inductor/test_flex_attention.py Show resolved Hide resolved

BoyuanFeng approved these changes Apr 22, 2025

View reviewed changes

drisspg added 2 commits April 22, 2025 14:36

Update

4ebadff

[ghstack-poisoned]

Update

ad80c8e

[ghstack-poisoned]

drisspg mentioned this pull request Apr 22, 2025

[FlexAttention] Remove Old Constraint on lastdim strides #151959

Closed

Update

ee8a981

[ghstack-poisoned]

liangan1 reviewed Apr 23, 2025

View reviewed changes

pytorchmergebot removed the merging label Apr 23, 2025

Update

1941cf1

[ghstack-poisoned]

pytorchmergebot added the merging label Apr 23, 2025

pytorchmergebot closed this in b37fa20 Apr 23, 2025

pytorchmergebot added Merged and removed merging labels Apr 23, 2025

jithunnair-amd added the ciflow/rocm Trigger "default" config CI on ROCm label Apr 23, 2025

pytorchmergebot added Reverted ci-no-td Do not run TD on this PR labels Apr 23, 2025

pytorchmergebot reopened this Apr 23, 2025

Update

7703789

[ghstack-poisoned]

pytorchmergebot closed this in 2455ded Apr 24, 2025

pytorchmergebot pushed a commit that referenced this pull request Apr 24, 2025

[FlexAttention] Remove Old Constraint on lastdim strides (#151959)

4e1d433

Fixes: #148827 Pull Request resolved: #151959 Approved by: https://github.com/Chillee ghstack dependencies: #151846

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FlexAttention] Fix device test instantation #151846

[FlexAttention] Fix device test instantation #151846

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

	decorated_func = skipCPUIf(True, "Not supported on CUDA")(test_func)
	decorated_func = skipCPUIf(True, "Not supported on CPU")(test_func)

[FlexAttention] Fix device test instantation #151846

[FlexAttention] Fix device test instantation #151846

Conversation

Uh oh!

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/151846

⏳ No Failures, 5 Pending

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Merge failed

Uh oh!

Uh oh!

Merge started

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!