[SDPA] Add testing to ensure stride order exactly matches #152894

drisspg · 2025-05-06T01:18:53Z

Stack from ghstack (oldest at bottom):

-> [SDPA] Add testing to ensure stride order exactly matches #152894

Currently results

TODO update meta before land for mem eff

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2025-05-06T01:18:56Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152894

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 5663e65 Pull Request resolved: #152894

[ghstack-poisoned]

ghstack-source-id: 86b7979 Pull Request resolved: #152894

drisspg · 2025-05-06T01:45:52Z

Update mem eff striding

[ghstack-poisoned]

ghstack-source-id: 9116ce7 Pull Request resolved: #152894

[ghstack-poisoned]

ghstack-source-id: 4a2bdbc Pull Request resolved: #152894

[ghstack-poisoned]

ghstack-source-id: 6124d74 Pull Request resolved: #152894

drisspg · 2025-05-06T02:38:14Z

test/test_transformers.py

@@ -2469,6 +2469,73 @@ def test_cudnn_attention_different_dk_dv(self, device):

        self.assertEqual(actual.contiguous(), math_ref.contiguous().to(dtype), atol=1e-3, rtol=1e-2)

+    @unittest.skipIf(not PLATFORM_SUPPORTS_FUSED_ATTENTION, "Fused SDPA was not built for this system")
+    @parametrize("backend", PLATFORM_SPECIFIC_SDPA, name_fn=lambda x: x.name)
+    @parametrize("compile_mode", ["eager", "inductor"])


So locally all test pass but that doesn't seem possible, since i didnt even update the meta for mem eff yet

i wonder if there are too many recompiles and then just falls back to eager..

eellison · 2025-05-06T13:56:28Z

test/test_transformers.py

+        if compile_mode == "inductor":
+            run_sdpa = torch.compile(run_sdpa, backend="inductor", fullgraph=True)
+
+        with sdpa_kernel(backends=[backend]):


if compile_mode == "eager", can you enable CrossRefFakeMode ?

[ghstack-poisoned]

ghstack-source-id: 024b248 Pull Request resolved: #152894

[ghstack-poisoned]

ghstack-source-id: 7a94e85 Pull Request resolved: #152894

[ghstack-poisoned]

ghstack-source-id: 721d10e Pull Request resolved: #152894

[ghstack-poisoned]

ghstack-source-id: a65f99f Pull Request resolved: #152894

torch/_meta_registrations.py

Skylion007 · 2025-05-07T12:54:33Z

aten/src/ATen/native/transformers/cuda/attention_backward.cu

-    grad_q = at::empty(query.sizes(), query.options());
-    grad_k = at::empty(key.sizes(), key.options());
-    grad_v = at::empty(value.sizes(), value.options());
+    grad_q = at::empty_like(query);


Pretty sure flash attention used to have the same bug, I guess it was copied and pasted from here and never fixed here.

[ghstack-poisoned]

ghstack-source-id: f31fe7c Pull Request resolved: #152894

[ghstack-poisoned]

ghstack-source-id: 466e931 Pull Request resolved: #152894

Update

2b99b20

[ghstack-poisoned]

pytorch-bot bot added the topic: not user facing topic category label May 6, 2025

drisspg added a commit that referenced this pull request May 6, 2025

[SDPA] Add testing to ensure stride order exactly matches

57a5178

ghstack-source-id: 5663e65 Pull Request resolved: #152894

pytorch-bot bot added the topic: not user facing topic category label May 6, 2025

Update

21735bd

[ghstack-poisoned]

drisspg added a commit that referenced this pull request May 6, 2025

[SDPA] Add testing to ensure stride order exactly matches

3a4b57a

ghstack-source-id: 86b7979 Pull Request resolved: #152894

Update

b8921f4

[ghstack-poisoned]

drisspg added a commit that referenced this pull request May 6, 2025

[SDPA] Add testing to ensure stride order exactly matches

c573b16

ghstack-source-id: 9116ce7 Pull Request resolved: #152894

Update

b9ece2d

[ghstack-poisoned]

drisspg added a commit that referenced this pull request May 6, 2025

[SDPA] Add testing to ensure stride order exactly matches

fe1fbf9

ghstack-source-id: 4a2bdbc Pull Request resolved: #152894

Update

ec78430

[ghstack-poisoned]

drisspg added a commit that referenced this pull request May 6, 2025

[SDPA] Add testing to ensure stride order exactly matches

8a25f71

ghstack-source-id: 6124d74 Pull Request resolved: #152894

drisspg commented May 6, 2025

View reviewed changes

eellison reviewed May 6, 2025

View reviewed changes

Update

2a2f9bf

[ghstack-poisoned]

drisspg added a commit that referenced this pull request May 6, 2025

[SDPA] Add testing to ensure stride order exactly matches

5188980

ghstack-source-id: 024b248 Pull Request resolved: #152894

Update

7a7c4d1

[ghstack-poisoned]

drisspg added a commit that referenced this pull request May 6, 2025

[SDPA] Add testing to ensure stride order exactly matches

02f2496

ghstack-source-id: 7a94e85 Pull Request resolved: #152894

Update

c7a42a7

[ghstack-poisoned]

drisspg added a commit that referenced this pull request May 6, 2025

[SDPA] Add testing to ensure stride order exactly matches

07600f7

ghstack-source-id: 721d10e Pull Request resolved: #152894

pytorch-bot bot added ciflow/inductor module: inductor labels May 6, 2025

Update

94e5870

[ghstack-poisoned]

drisspg added a commit that referenced this pull request May 6, 2025

[SDPA] Add testing to ensure stride order exactly matches

aad2b09

ghstack-source-id: a65f99f Pull Request resolved: #152894

Skylion007 reviewed May 7, 2025

View reviewed changes

torch/_meta_registrations.py Outdated Show resolved Hide resolved

Skylion007 reviewed May 7, 2025

View reviewed changes

drisspg mentioned this pull request May 12, 2025

torch.compile causes stride mismatch in SDPA with non-contiguous query in torch 2.7 #152747

Open

Update

988ba0c

[ghstack-poisoned]

drisspg added a commit that referenced this pull request May 14, 2025

[SDPA] Add testing to ensure stride order exactly matches

f92ceb0

ghstack-source-id: f31fe7c Pull Request resolved: #152894

drisspg added a commit that referenced this pull request May 14, 2025

[SDPA] Add testing to ensure stride order exactly matches

9072822

ghstack-source-id: f31fe7c Pull Request resolved: #152894

Update

8f1360c

[ghstack-poisoned]

drisspg added a commit that referenced this pull request May 14, 2025

[SDPA] Add testing to ensure stride order exactly matches

f77b627

ghstack-source-id: 466e931 Pull Request resolved: #152894

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SDPA] Add testing to ensure stride order exactly matches #152894

[SDPA] Add testing to ensure stride order exactly matches #152894

[SDPA] Add testing to ensure stride order exactly matches #152894

Are you sure you want to change the base?

[SDPA] Add testing to ensure stride order exactly matches #152894

Conversation

Currently results

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152894

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment