8000 Tags · AD2605/llama.cpp · GitHub
[go: up one dir, main page]

Skip to content

Tags: AD2605/llama.cpp

Tags

b5854

Toggle b5854's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
ggml : prevent integer overflow in gguf tensor size calculation (ggml…

…-org#14595)

b5795

Toggle b5795's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
CANN: update aclnnGroupedMatmulV2 to aclnnGroupedMatmulV3 (ggml-org#1…

…4411)

* [CANN]update to aclnnGroupedMatmulV2

Signed-off-by: noemotiovon <757486878@qq.com>

* Support MUL_MAT_ID on 310p

Signed-off-by: noemotiovon <757486878@qq.com>

* fix editorconfig

Signed-off-by: noemotiovon <757486878@qq.com>

---------

Signed-off-by: noemotiovon <757486878@qq.com>

b5787

Toggle b5787's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Add Conv2d for CPU (ggml-org#14388)

* Conv2D: Add CPU version

* Half decent

* Tiled approach for F32

* remove file

* Fix tests

* Support F16 operations

* add assert about size

* Review: further formatting fixes, add assert and use CPU version of fp32->fp16

b5753

Toggle b5753's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
opencl: ref count `ggml_backend_opencl_context` and refactor profiling (

ggml-org#14254)

* Move profiling info into `ggml_backend_opencl_context`
* Add `enqueue_ndrange_kernel` to launch kernel

b5716

Toggle b5716's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
ggml : fix repack work size for mul_mat_id (ggml-org#14292)

ggml-ci

b5688

Toggle b5688's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
ggml-cpu : remove the weak alias trick (ggml-org#14221)

b5611

Toggle b5611's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
webui: fix sidebar being covered by main content (ggml-org#14082)

* webui: fix sidebar being covered by main content

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

* webui: update index.html.gz

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

---------

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

b5518

Toggle b5518's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
convert : fix tensor naming conflict for llama 4 vision (ggml-org#13836)

* convert : fix tensor naming conflict for llama 4 vision

* add comment

b5503

Toggle b5503's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
sampling : make sure samplers return at least 1 token (ggml-org#13822)

* sampling : min-p should always return at least one token

ggml-ci

* sampling : same for typical sampling

* tests : sampling tests use min_keep == 0

ggml-ci

b5467

Toggle b5467's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
llama : allow custom list of swa_layers (ggml-org#13726)

0