Tags · AD2605/llama.cpp

b5854

ggml : prevent integer overflow in gguf tensor size calculation (ggml…

…-org#14595)

Jul 9, 2025
26a48ad
zip
tar.gz
Downloads

b5795

CANN: update aclnnGroupedMatmulV2 to aclnnGroupedMatmulV3 (ggml-org#1…

…4411)

* [CANN]update to aclnnGroupedMatmulV2

Signed-off-by: noemotiovon <757486878@qq.com>

* Support MUL_MAT_ID on 310p

Signed-off-by: noemotiovon <757486878@qq.com>

* fix editorconfig

Signed-off-by: noemotiovon <757486878@qq.com>

---------

Signed-off-by: noemotiovon <757486878@qq.com>

Jul 1, 2025
343b6e9
zip
tar.gz
Downloads

b5787

Add Conv2d for CPU (ggml-org#14388)

* Conv2D: Add CPU version

* Half decent

* Tiled approach for F32

* remove file

* Fix tests

* Support F16 operations

* add assert about size

* Review: further formatting fixes, add assert and use CPU version of fp32->fp16

Jun 30, 2025
0a5a3b5
zip
tar.gz
Downloads

b5753

opencl: ref count `ggml_backend_opencl_context` and refactor profiling (

ggml-org#14254)

* Move profiling info into `ggml_backend_opencl_context`
* Add `enqueue_ndrange_kernel` to launch kernel

Jun 24, 2025
73e53dc
zip
tar.gz
Downloads

b5716

ggml : fix repack work size for mul_mat_id (ggml-org#14292)

ggml-ci

Jun 20, 2025
d27b3ca
zip
tar.gz
Downloads

b5688

ggml-cpu : remove the weak alias trick (ggml-org#14221)

Jun 17, 2025
860a9e4
zip
tar.gz
Downloads

b5611

webui: fix sidebar being covered by main content (ggml-org#14082)

* webui: fix sidebar being covered by main content

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

* webui: update index.html.gz

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

---------

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

Jun 9, 2025
dc0623f
zip
tar.gz
Downloads

b5518

convert : fix tensor naming conflict for llama 4 vision (ggml-org#13836)

* convert : fix tensor naming conflict for llama 4 vision

* add comment

May 28, 2025
26b79b6
zip
tar.gz
Downloads

b5503

sampling : make sure samplers return at least 1 token (ggml-org#13822)

* sampling : min-p should always return at least one token

ggml-ci

* sampling : same for typical sampling

* tests : sampling tests use min_keep == 0

ggml-ci

May 27, 2025
f9cd683
zip
tar.gz
Downloads

b5467

llama : allow custom list of swa_layers (ggml-org#13726)

May 23, 2025
8a2afb7
zip
tar.gz
Downloads

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

b5854

b5795

b5787

b5753

b5716

b5688

b5611

b5518

b5503

b5467

Tags: AD2605/llama.cpp