[Inductor] Construct subgraph with benchmarking args not example_inputs #153667

PaulZhang12 · 2025-05-15T22:05:36Z

Summary: If the inputs to a subgraph has FlexibleLayout, the subgraph does not currently freeze the layouts here. Therefore, the example_inputs generated might not be consistent in layout with the args based in for benchmarking

Test Plan:
`
M, N, K = (4, 128, 14240)
import torch
torch.set_default_device("cuda")

torch.compile(mode='max-autotune-no-cudagraphs')
def foo(x, y):
return (x + 1) @ y

inps = [torch.rand([256, 256]) for _ in range(2)]
foo(*inps)
`

To produce FlexibleLayout. Also ran test_max_autotune.py

Differential Revision: D74484747

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

Differential Revision: D74259569

pytorch-bot · 2025-05-15T22:05:40Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/153667

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit d53847f with merge base 7e16cb9 ():

NEW FAILURES - The following jobs have failed:

Lint / lintrunner-noclang / linux-job (gh)
>>> Lint for torch/_inductor/scheduler.py:
pull / linux-focal-cuda12.6-py3.10-gcc11-sm89 / test (default, 2, 5, linux.g6.4xlarge.experimental.nvidia.gpu) (gh)
inductor/test_subgraph_choice.py::TestSubgraphChoice::test_subgraph_decompose_k

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-05-15T22:05:45Z

This pull request was exported from Phabricator. Differential Revision: D74484747

facebook-github-bot · 2025-05-16T04:49:20Z

This pull request was exported from Phabricator. Differential Revision: D74484747

…ts (pytorch#153667) Summary: Pull Request resolved: pytorch#153667 If the inputs to a subgraph has FlexibleLayout, the subgraph does not currently freeze the layouts here. Therefore, the `example_inputs` generated might not be consistent in layout with the `args` based in for benchmarking Test Plan: ` M, N, K = (4, 128, 14240) import torch torch.set_default_device("cuda") torch.compile(mode='max-autotune-no-cudagraphs') def foo(x, y): return (x + 1) @ y inps = [torch.rand([M, K], device='cuda', dtype=torch.bfloat16), torch.rand([K, N], device='cuda', dtype=torch.bfloat16)] foo(*inps) ` To produce FlexibleLayout with the stride change. Added to `test_subgraph_choice` Differential Revision: D74484747

facebook-github-bot · 2025-05-16T05:20:23Z

This pull request was exported from Phabricator. Differential Revision: D74484747

…ts (pytorch#153667) Summary: If the inputs to a subgraph has FlexibleLayout, the subgraph does not currently freeze the layouts here. Therefore, the `example_inputs` generated might not be consistent in layout with the `args` based in for benchmarking Test Plan: ` M, N, K = (4, 128, 14240) import torch torch.set_default_device("cuda") torch.compile(mode='max-autotune-no-cudagraphs') def foo(x, y): return (x + 1) @ y inps = [torch.rand([M, K], device='cuda', dtype=torch.bfloat16), torch.rand([K, N], device='cuda', dtype=torch.bfloat16)] foo(*inps) ` To produce FlexibleLayout with the stride change. Added to `test_subgraph_choice` Differential Revision: D74900879

Temporary Commit at 5/6/2025, 1:33:46 PM

6775a4e

Differential Revision: D74259569

pytorch-bot bot added ciflow/inductor module: inductor labels May 15, 2025

facebook-github-bot added the fb-exported label May 15, 2025

PaulZhang12 force-pushed the export-D74484747 branch from 0fb6441 to 2a36eba Compare May 16, 2025 04:49

PaulZhang12 added the topic: not user facing topic category label May 16, 2025

PaulZhang12 force-pushed the export-D74484747 branch from 2a36eba to 01f423c Compare May 16, 2025 05:16

PaulZhang12 force-pushed the export-D74484747 branch from 01f423c to d53847f Compare May 16, 2025 05:20

PaulZhang12 closed this May 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Inductor] Construct subgraph with benchmarking args not example_inputs #153667

[Inductor] Construct subgraph with benchmarking args not example_inputs #153667

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[Inductor] Construct subgraph with benchmarking args not example_inputs #153667

[Inductor] Construct subgraph with benchmarking args not example_inputs #153667

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/153667

❌ 2 New Failures

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!