-
Notifications
You must be signed in to change notification settings - Fork 12.5k
Arm AArch64: optimized GEMV and GEMM kernels for q4_0_q8_0, and q8_0_q8_0 quantization #5780
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 1 commit
Commits
Show all changes
28 commits
Select commit
Hold shift + click to select a range
002e36e
Arm AArch64: optimized GEMV and GEMM kernels for q4_0_q8_0, and q8_0_…
Dibakar 340ef07
Arm AArch64: add optimized GEMV and GEMM asm kernels for q4_0_q8_0 qu…
Dibakar 81215ff
Arm AArch64: add optimized GEMV and GEMM asm kernels for q4_0_q8_0 qu…
Dibakar 6c8d826
Arm AArch64: add optimized GEMV and GEMM asm kernels for q4_0_q8_0 qu…
Dibakar 43e1297
Arm AArch64: add optimized GEMV and GEMM asm kernels for q4_0_q8_0 qu…
Dibakar 441ab64
Arm AArch64: add copyright claim only to ggml-aarch64.cpp and ggml-aa…
Dibakar 8ee6779
Arm AArch64: minor code refactoring for rebase
Dibakar a657246
Arm AArch64: minor code refactoring for resolving a build issue with …
Dibakar 746b57f
Arm AArch64: minor code refactoring to split the Q4_0_AARC64 type int…
Dibakar 5d10c21
Arm AArch64: minor code change for resolving a build issue with serve…
Dibakar 7ac03e5
retrigger checks
Dibakar e2c1c47
Arm AArch64: minor code changes for rebase
Dibakar 79b6cdf
Arm AArch64: minor changes to skip the pr#7433 vec_dot code for arm c…
Dibakar 3c1ad5f
Arm AArch64: remove stale LLAMA_QKK_64 from CMakeLists.txt and delete…
Dibakar a7055b7
Arm AArch64: add reference scalar gemm and gemv, and avoid dynamic me…
Dibakar cce236b
Arm AArch64: add multithreaded quantization support for the new types…
Dibakar 7a70606
Arm AArch64: minor code refactoring
Dibakar ffbfabb
Arm AArch64: simplify logic for calling gemm and gemv functions in gg…
Dibakar cbbfd69
Arm AArch64: minimize changes in ggml_compute_forward_mul_mat
Dibakar 3564644
Arm AArch64: minor code refactoring, and add reference scalar code to…
Dibakar 110d143
Arm AArch64: minor code refactoring
Dibakar 4ff0b22
Arm AArch64: minor code refactoring
Dibakar 42724b4
Arm AArch64: minor code refactoring
Dibakar e5f4713
rebase on the latest master commit 3fd62a6 and adapt to the new direc…
Dibakar c2595d0
Arm AArch64: remove a redundant comment
Dibakar a7abb78
Arm AArch64: add pragma in ggml-aarch64.c to turn -Woverlength-string…
Dibakar 0e84ef1
Arm AArch64: use __aarch64__ check to guard 64-bit neon kernels
Dibakar c653eb1
Arm AArch64: update docs/build.md README to include compile time flag…
Dibakar File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Arm AArch64: remove stale LLAMA_QKK_64 from CMakeLists.txt and delete…
… build.zig
- Loading branch information
commit 3c1ad5fe3c673dca23f750f746e5bfcf7ff516f2
There are no files selected for viewing
This file was deleted.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.