Releases: ggml-org/llama.cpp
Releases · ggml-org/llama.cpp
b5522
b5519
CUDA: fix FA tg at long context for CC >= 8.9 (#13852)
b5517
CANN: Add SOC TYPE printing in cmake configuration (#13837)
b5516
opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, …
b5515
opencl: mark `mul_mat` `f32f32` as supporting non-contiguous tensors …
b5514
vulkan: use timestamp queries for GGML_VULKAN_PERF (#13817) Also change it to be controlled by an env var rather than cmake flag
b5513
cmake : add llama-cparams.cpp to build (#13832)
b5512
SYCL: add gelu_erf kernel (#13749) * SYCL: add gelu_erf kernel * refactor code Co-authored-by: Atharva Dubey <atharva.dubey@codeplay.com> * Use scope_op_debug_print --------- Co-authored-by: Atharva Dubey <atharva.dubey@codeplay.com>
b5510
ggml : add ggml_repeat_4d (#13824)
b5509
ggml : riscv: add xtheadvector support (#13720) * ggml : riscv: add xtheadvector support * ggml : clean up some macro usage