[inductor] [cpp] Support vectorization for score and mask in FlexAttention CPU #143638

chunyuan-w · 2024-12-20T08:47:38Z

pytorch-bot · 2024-12-20T08:47:42Z

torch/_inductor/kernel/flex_attention.py

chunyuan-w · 2025-01-21T01:47:01Z

drisspg · 2025-01-21T20:50:14Z

torch/_inductor/kernel/flex_attention.py

+        layout=FixedLayout(
+            device,
+            dtype,
+            size if size else [],


Since we create the subgraphs using scalars we dont any real size or stride information, why this change?

Before this PR, we create the subgraphs using scalars. In this PR, in order to generate vectorized code, we create subgraphs using tensors and thus we changed this function
https://github.com/pytorch/pytorch/blob/gh/chunyuan-w/3/head/torch/_inductor/kernel/flex_attention.py#L918-L922

torch/_inductor/kernel/flex_attention.py

chunyuan-w · 2025-02-05T01:14:54Z

drisspg

chunyuan-w · 2025-02-14T05:18:52Z

pytorchmergebot · 2025-02-14T05:20:34Z

Update

ff80345

[ghstack-poisoned]

pytorch-bot bot added ciflow/inductor module: inductor labels Dec 20, 2024

chunyuan-w added a commit that referenced this pull request Dec 20, 2024

[inductor] [cpp] Support vectorization for score and mask in FlexAtte…

2daa944

…ntion CPU ghstack-source-id: dfdf169 Pull Request resolved: #143638

chunyuan-w marked this pull request as draft December 20, 2024 08:48

chunyuan-w added the topic: not user facing topic category label Dec 20, 2024

pytorchbot added the open source label Dec 20, 2024

Update

9361555

[ghstack-poisoned]

chunyuan-w added a commit that referenced this pull request Jan 9, 2025

[inductor] [cpp] Support vectorization for score and mask in FlexAtte…

4cc23a8

…ntion CPU ghstack-source-id: 0fc5b73 Pull Request resolved: #143638

Update

ee1a017

[ghstack-poisoned]

Update

0333869

[ghstack-poisoned]

chunyuan-w added a commit that referenced this pull request Jan 10, 2025

[inductor] [cpp] Support vectorization for score and mask in FlexAtte…

54c45b1

…ntion CPU ghstack-source-id: 5bcae47 Pull Request resolved: #143638

chunyuan-w marked this pull request as ready for review January 10, 2025 07:48

chunyuan-w requested review from leslie-fang-intel and jgong5 January 13, 2025 01:26

leslie-fang-intel reviewed Jan 13, 2025

View reviewed changes

torch/_inductor/kernel/flex_attention.py Show resolved Hide resolved

torch/_inductor/kernel/flex_attention.py Show resolved Hide resolved

torch/_inductor/kernel/flex_attention.py Show resolved Hide resolved

leslie-fang-intel requested a review from drisspg January 13, 2025 02:43

jgong5 reviewed Jan 13, 2025

View reviewed changes

torch/_inductor/kernel/flex_attention.py Show resolved Hide resolved

chunyuan-w requested review from jgong5 and leslie-fang-intel January 21, 2025 01:46

8000

drisspg reviewed Jan 21, 2025

View reviewed changes

drisspg reviewed Jan 21, 2025

View reviewed changes

torch/_inductor/kernel/flex_attention.py Show resolved Hide resolved

jgong5 approved these changes Jan 22, 2025

View reviewed changes

Update

058b83a

[ghstack-poisoned]

pytorch-bot bot temporarily deployed to upload-benchmark-results January 22, 2025 03:26 Inactive

pytorch-bot bot temporarily deployed to upload-benchmark-results January 22, 2025 04:04 Inactive

Update

40c2895

[ghstack-poisoned]

chunyuan-w added a commit that referenced this pull request Jan 22, 2025

[inductor] [cpp] Support vectorization for score and mask in FlexAtte…

2a8ba5d

…ntion CPU ghstack-source-id: 634840e Pull Request resolved: #143638

pytorch-bot bot temporarily deployed to upload-benchmark-results January 22, 2025 06:14 Inactive

chunyuan-w requested a review from drisspg January 23, 2025 01:52

drisspg approved these changes Feb 5, 2025

View reviewed changes

chunyuan-w added a commit that referenced this pull request Feb 5, 2025

[inductor] [cpp] Support vectorization for score and mask in FlexAtte…

f9658d2

…ntion CPU ghstack-source-id: 0cee269 Pull Request resolved: #143638

chunyuan-w added the ciflow/trunk Trigger trunk jobs on your pull request label Feb 5, 2025

Update

6b1a732

[ghstack-poisoned]

chunyuan-w added a commit that referenced this pull request Feb 7, 2025

[inductor] [cpp] Support vectorization for score and mask in FlexAtte…

bde9943

…ntion CPU ghstack-source-id: 19b5df2 Pull Request resolved: #143638

Update

da45443

[ghstack-poisoned]

leslie-fang-intel approved these changes Feb 11, 2025

View reviewed changes

chunyuan-w added a commit to chunyuan-w/pytorch that referenced this pull request Feb 13, 2025

port pytorch#143638

f0510f1

chunyuan-w added a commit that referenced this pull request Feb 13, 2025

[inductor] [cpp] Support vectorization for score and mask in FlexAtte…

b1608a2

…ntion CPU ghstack-source-id: 69d40fa Pull Request resolved: #143638

Update

f7b901d

[ghstack-poisoned]

pytorchmergebot added the merging label Feb 14, 2025

pytorchmergebot closed this in 331d5cf Feb 14, 2025

pytorchmergebot added Merged and removed merging labels Feb 14, 2025

github-actions bot deleted the gh/chunyuan-w/3/head branch March 25, 2025 02:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[inductor] [cpp] Support vectorization for score and mask in FlexAttention CPU #143638

[inductor] [cpp] Support vectorization for score and mask in FlexAttention CPU #143638

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[inductor] [cpp] Support vectorization for score and mask in FlexAttention CPU #143638

[inductor] [cpp] Support vectorization for score and mask in FlexAttention CPU #143638

Uh oh!

Conversation

Uh oh!

Description

Modification

Benchmark

Test plan

Output code

Uh oh!

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/143638

✅ You can merge normally! (3 Unrelated Failures)

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Merge started

Uh oh!

Uh oh!