Avoid calling fallback directly for symmetric memory tests #153520

fegin · 2025-05-14T05:44:28Z

Stack from ghstack (oldest at bottom):

Since we can just use _test_mode to dispatch the calls, we should use this method to also verify the function signature is consistent.

cc @H-Huang @awgu @wanchaol @fduwjj @wz337 @wconstab @d4l3k

[ghstack-poisoned]

Since we can just use _test_mode to dispatch the calls, we should use this method to also verify the function signature is consistent. ghstack-source-id: 4cd25df Pull-Request-resolved: #153520

pytorch-bot · 2025-05-14T05:44:32Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/153520

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[PREEMPTIVE] Removal of ephemeral variants on scale-config.yml

❌ 1 New Failure

As of commit eff8f67 with merge base a13c8f2 ():

NEW FAILURE - The following job has failed:

Lint / lintrunner-noclang / linux-job (gh)
>>> Lint for test/distributed/test_symmetric_memory.py:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

kwen2501 · 2025-05-14T15:47:19Z

test/distributed/test_symmetric_memory.py

+        ag_output_vec = []
+        mm_outputs_vec = []
+        for context in test_contexts:
+            with context():


nit: it seems a little bit obscure to wrap the test in a loop (and use vector).

Since there is only 1 special context here, how about:

ag_output_0, mm_outputs_0 = torch.ops.symm_mem.fused_all_gather_matmul( A_shard, Bs, gather_dim=gather_dim, group_name=group.group_name ) with _test_mode(): ag_output_1, mm_outputs_1 = torch.ops.symm_mem.fused_all_gather_matmul( A_shard, Bs, gather_dim=gather_dim, group_name=group.group_name ) assert torch.allclose(ag_output_0, ag_output_1)

(Like you did for the other test below)

Update

eff8f67

[ghstack-poisoned]

fegin mentioned this pull request May 14, 2025

Fix AsyncMM not compiled with SM90a issue #153519

Closed

pytorch-bot bot added oncall: distributed Add this issue/PR to distributed oncall triage queue topic: not user facing topic category labels May 14, 2025

fegin requested review from kwen2501 and fduwjj May 14, 2025 05:47

kwen2501 approved these changes May 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid calling fallback directly for symmetric memory tests #153520

Avoid calling fallback directly for symmetric memory tests #153520

Avoid calling fallback directly for symmetric memory tests #153520

Are you sure you want to change the base?

Avoid calling fallback directly for symmetric memory tests #153520

Conversation

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/153520

❗ 1 Active SEVs

❌ 1 New Failure

Choose a reason for hiding this comment

Choose a reason for hiding this comment