[Inductor] Pick ISA for inductor based on ATEN_CPU_CAPABILITY #123514

CaoE · 2024-04-07T03:26:15Z

Stack from ghstack (oldest at bottom):

-> [Inductor] Pick ISA for inductor based on ATEN_CPU_CAPABILITY #123514

It is part of #123224. Pick ISA based on the environment ATEN_CPU_CAPABILITY to control CPU vec ISA level for Inductor like eager.

cc @ezyang @chauhang @penguinwu @voznesenskym @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @rec @msaroufim @bdhirsh @anijain2305 @peterbell10 @aakhundov

[ghstack-poisoned]

pytorch-bot · 2024-04-07T03:26:19Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/123514

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 38ec5c4 with merge base 67883e7 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

periodic / win-vs2019-cuda11.8-py3 / test (default, 1, 4, windows.g5.4xlarge.nvidia.gpu) (gh) (disabled by #137936)
test_linalg.py::TestLinalgCUDA::test_matmul_offline_tunableop_cuda_float16

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

ghstack-source-id: 75ae45b Pull Request resolved: #123514

jgong5 · 2024-04-08T01:44:09Z

torch/_inductor/codecache.py

+        and not isinstance(_valid_vec_isa_list[0], VecNEON)
+        and not isinstance(_valid_vec_isa_list[0], VecZVECTOR)


compute_cpu_capability also handles vsx and zvector. We should also consider them here?

Taking VecNEON and VecZVECTOR into account.

jgong5 · 2024-04-08T01:47:50Z

aten/src/ATen/native/DispatchStub.cpp

-  static CPUCapability capability = compute_cpu_capability();
+  CPUCapability capability = compute_cpu_capability();


Why removing static here?

Changed it back

jgong5 · 2024-04-08T01:49:10Z

torch/_inductor/codecache.py

    # If the simdlen is None, it indicates determin the vectorization length automatically
    if config.cpp.simdlen is None:
        assert _valid_vec_isa_list


perhaps we don't need this check any longer.

Removed the check

[ghstack-poisoned]

ghstack-source-id: 8b0e45e Pull Request resolved: #123514

pytorchmergebot · 2024-09-30T00:47:47Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

huydhn · 2024-09-30T15:45:14Z

@pytorchbot revert -m 'Sorry for reverting your change but its test_cpu_repro test is failing in trunk https://hud.pytorch.org/pytorch/pytorch/commit/6931c1644afdba53e63ce5671455e4e1b7265dd9' -c nosignal

I think a rebase is needed

pytorchmergebot · 2024-09-30T15:46:54Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

pytorchmergebot · 2024-09-30T15:47:07Z

@CaoE your PR has been successfully reverted.

…#123514)" This reverts commit 6931c16. Reverted #123514 on behalf of https://github.com/huydhn due to Sorry for reverting your change but its test_cpu_repro test is failing in trunk https://hud.pytorch.org/pytorch/pytorch/commit/6931c1644afdba53e63ce5671455e4e1b7265dd9 ([comment](#123514 (comment)))

…pytorch#123514)" This reverts commit 6931c16. Reverted pytorch#123514 on behalf of https://github.com/huydhn due to Sorry for reverting your change but its test_cpu_repro test is failing in trunk https://hud.pytorch.org/pytorch/pytorch/commit/6931c1644afdba53e63ce5671455e4e1b7265dd9 ([comment](pytorch#123514 (comment)))

[ghstack-poisoned]

ghstack-source-id: 817e586 Pull Request resolved: #123514

[ghstack-poisoned]

ghstack-source-id: 69b6b5b Pull Request resolved: #123514

github-actions

Please commit the suggested changes from pytorch's linter.

test/inductor/test_cpu_repro.py

[ghstack-poisoned]

ghstack-source-id: 11fa280 Pull Request resolved: #123514

[ghstack-poisoned]

ghstack-source-id: 00be4ed Pull Request resolved: #123514

CaoE · 2024-10-13T11:24:18Z

@huydhn The PR is rebased. Could you please help import this this PR to see if it will break internal checks ?

huydhn · 2024-10-14T10:17:28Z

Umm, unfortunately, I couldn't import this PR because it uses ghstack. Only the stack owner can do so (folks from Meta do that themselves). There is no work around that I know atm, so I think let's just merge this and let our oncall check it later then.

CaoE · 2024-10-17T08:58:55Z

@pytorchbot merge

pytorchmergebot · 2024-10-17T09:01:29Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Update

3996e44

[ghstack-poisoned]

pytorch-bot bot added ciflow/inductor module: inductor labels Apr 7, 2024

CaoE marked this pull request as draft April 7, 2024 03:26

CaoE changed the title ~~Set sisdlen according to _get_cpu_capability~~ Set simdlen according to _get_cpu_capability Apr 7, 2024

CaoE added ciflow/trunk Trigger trunk jobs on your pull request ciflow/periodic Trigger jobs ran periodically on master (periodic.yml) on the PR labels Apr 7, 2024

pytorchbot added the open source label Apr 7, 2024

Update

6da1626

[ghstack-poisoned]

Update

ab7dae3

[ghstack-poisoned]

CaoE added a commit that referenced this pull request Apr 7, 2024

Set simdlen according to _get_cpu_capability

3a172d9

ghstack-source-id: 75ae45b Pull Request resolved: #123514

CaoE requested a review from jgong5 April 7, 2024 12:09

jgong5 requested changes Apr 8, 2024

View reviewed changes

Update

4d64aef

[ghstack-poisoned]

Update

8313e6a

[ghstack-poisoned]

Update

1ab6aa5

[ghstack-poisoned]

Update

172a07a

[ghstack-poisoned]

Update

14d97dc

[ghstack-poisoned]

CaoE added a commit that referenced this pull request Apr 9, 2024

Set simdlen according to _get_cpu_capability

9effc41

ghstack-source-id: 8b0e45e Pull Request resolved: #123514

CaoE changed the title ~~Set simdlen according to _get_cpu_capability~~ Set simdlen based on the environment ATEN_CPU_CAPABILITY Apr 9, 2024

CaoE changed the title ~~Set simdlen based on the environment ATEN_CPU_CAPABILITY~~ Set simdlen based on ATEN_CPU_CAPABILITY Apr 9, 2024

pytorchmergebot added the merging label Sep 30, 2024

pytorchmergebot closed this in 6931c16 Sep 30, 2024

pytorchmergebot removed the merging label Sep 30, 2024

abhishek-iitmadras mentioned this pull request Sep 30, 2024

Extend vectorization with SVE(ARM) with Torch Compile (Inductor) #134672

Closed

pytorchmergebot reopened this Sep 30, 2024

Update

53d3291

[ghstack-poisoned]

CaoE added a commit that referenced this pull request Oct 9, 2024

Set simdlen based on ATEN_CPU_CAPABILITY

8490944

ghstack-source-id: 817e586 Pull Request resolved: #123514

Update

b5c77c0

[ghstack-poisoned]

CaoE added a commit that referenced this pull request Oct 11, 2024

Set simdlen based on ATEN_CPU_CAPABILITY

5dee168

ghstack-source-id: 69b6b5b Pull Request resolved: #123514

github-actions bot requested changes Oct 11, 2024

View reviewed changes

test/inductor/test_cpu_repro.py Outdated Show resolved Hide resolved

Update

085b4cd

[ghstack-poisoned]

CaoE added a commit that referenced this pull request Oct 11, 2024

Set simdlen based on ATEN_CPU_CAPABILITY

5c4dc82

ghstack-source-id: 11fa280 Pull Request resolved: #123514

Update

38ec5c4

[ghstack-poisoned]

CaoE added a commit that referenced this pull request Oct 12, 2024

Set simdlen based on ATEN_CPU_CAPABILITY

f02aea3

ghstack-source-id: 00be4ed Pull Request resolved: #123514

pytorchmergebot added the merging label Oct 17, 2024

pytorchmergebot closed this in 8cfe28e Oct 17, 2024

pytorchmergebot removed the merging label Oct 17, 2024

github-actions bot deleted the gh/CaoE/31/head branch November 17, 2024 02:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Inductor] Pick ISA for inductor based on ATEN_CPU_CAPABILITY #123514

[Inductor] Pick ISA for inductor based on ATEN_CPU_CAPABILITY #123514

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		and not isinstance(_valid_vec_isa_list[0], VecNEON)
		and not isinstance(_valid_vec_isa_list[0], VecZVECTOR)

		static CPUCapability capability = compute_cpu_capability();
		CPUCapability capability = compute_cpu_capability();

[Inductor] Pick ISA for inductor based on ATEN_CPU_CAPABILITY #123514

[Inductor] Pick ISA for inductor based on ATEN_CPU_CAPABILITY #123514

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/123514

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Merge started

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Merge started

Uh oh!

Uh oh!