-
-
Notifications
You must be signed in to change notification settings - Fork 10.9k
ENH: Improve the performance of einsum by using universal simd #17049
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
Closed
Changes from all commits
Commits
Show all changes
40 commits
Select commit
Hold shift + click to select a range
e26dcf7
new npyv intrinsics
Qiyu8 47118fb
einsum dispatch and usimd process
Qiyu8 ad0b3b4
update
Qiyu8 55200fc
add float32 benchmark case
Qiyu8 94cff77
Merge branch 'master' of github.com:numpy/numpy into einsum-usimd
Qiyu8 4d7d94d
update
Qiyu8 ae53e35
fix typos
Qiyu8 2e713b0
add avx512 reduce sum comments
Qiyu8 5e7cbd1
add non_contigous arrays ,improve reduce the sum
Qiyu8 80c0ed4
Merge branch 'master' of github.com:numpy/numpy into einsum-usimd
Qiyu8 9060231
rebase after split for a better review
Qiyu8 b0375dc
Merge branch 'einsum-usimd' of github.com:Qiyu8/numpy into einsum-usimd
Qiyu8 1990c13
headers reconstruct
Qiyu8 7b756af
use for loop replace begin repeat for readability
Qiyu8 4877e40
add ivdeps and handle header dependency
Qiyu8 168c6c9
Merge branch 'einsum-usimd' of github.com:Qiyu8/numpy into einsum-usimd
Qiyu8 954e642
revert to faster simd code
Qiyu8 50c6b7e
changed to baseline solution
Qiyu8 23e28c0
remove redundant typedef
Qiyu8 21f1c0b
update
Qiyu8 a07455a
Merge branch 'master' of github.com:numpy/numpy into einsum-usimd
Qiyu8 d298c8e
remove redundant intrinsics
Qiyu8 6dac52e
add blank lines
Qiyu8 985e5b2
add format
Qiyu8 88c2747
Update numpy/core/src/common/simd/avx512/arithmetic.h
Qiyu8 90026f9
Merge branch 'master' of github.com:numpy/numpy into einsum-usimd
Qiyu8 54943e0
modify the int to npy_intp
Qiyu8 e993af2
split benchmark and define common macro
Qiyu8 38f7382
avx2 test
Qiyu8 f351665
Merge branch 'einsum-usimd' of github.com:Qiyu8/numpy into einsum-usimd
Qiyu8 c6c1e30
explain for auto-vectorize part
Qiyu8 f18ade4
add explantion
Qiyu8 33b7d2a
remove duplicated message
Qiyu8 5a692ed
Update benchmarks/benchmarks/bench_linalg.py
Qiyu8 20d5cda
Update numpy/core/src/multiarray/einsum_sumprod.c.src
Qiyu8 83734bf
Merge branch 'master' of github.com:numpy/numpy into einsum-usimd
Qiyu8 f8f7482
Merge branch 'einsum-usimd' of github.com:Qiyu8/numpy into einsum-usimd
Qiyu8 1889738
Merge branch 'master' of github.com:numpy/numpy into einsum-usimd
Qiyu8 7ff7324
fix typos
Qiyu8 73f61c3
remove extra test
Qiyu8 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.