8000 Tags · ggml-org/llama.cpp · GitHub
[go: up one dir, main page]

Skip to content

Tags: ggml-org/llama.cpp

Tags

b5517

Toggle b5517's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
CANN: Add SOC TYPE printing in cmake configuration (#13837)

b5516

Toggle b5516's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, …

…`group_norm` (#13787)

* opencl: add `argsort`

* opencl: add `div`

* opencl: add `add_rows`

* opencl: add `sub`

* opencl: add `sigmoid`, both `f16` and `f32`

* opencl: add `group_norm`

b5515

Toggle b5515's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
opencl: mark `mul_mat` `f32f32` as supporting non-contiguous tensors (#…

…13790)

b5514

Toggle b5514's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
vulkan: use timestamp queries for GGML_VULKAN_PERF (#13817)

Also change it to be controlled by an env var rather than cmake flag

b5513

Toggle b5513's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
cmake : add llama-cparams.cpp to build (#13832)

b5512

Toggle b5512's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
SYCL: add gelu_erf kernel (#13749)

* SYCL: add gelu_erf kernel

* refactor code

Co-authored-by: Atharva Dubey <atharva.dubey@codeplay.com>

* Use scope_op_debug_print

---------

Co-authored-by: Atharva Dubey <atharva.dubey@codeplay.com>

b5510

Toggle b5510's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
ggml : add ggml_repeat_4d (#13824)

b5509

Toggle b5509's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
ggml : riscv: add xtheadvector support (#13720)

* ggml : riscv: add xtheadvector support

* ggml : clean up some macro usage

b5508

Toggle b5508's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
mtmd : support Qwen 2.5 Omni (input audio+vision, no audio output) (#…

…13784)

* mtmd : allow multiple modalities at the same time

* refactor mtmd tokenizer

* fix compile

* ok, missing SinusoidsPositionEmbedding

* first working version

* fix style

* more strict validate of n_embd

* refactor if..else to switch

* fix regression

* add test for 3B

* update docs

* fix tokenizing with add_special

* add more tests

* fix test case "huge"

* rm redundant code

* set_position_mrope_1d rm n_tokens

b5506

Toggle b5506's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
ggml-cpu: x86 feature detection is specific to x86 (#13811)

0