Releases: ochafik/llama.cpp
Releases · ochafik/llama.cpp
b5546
b5537
llama : add support for jina-reranker-v2 (#13900)
b5500
scripts : add option to compare commits in Debug (#13806) * scripts : add option to compare commits in Debug * cont : reuse existing CMAKE_OPTS
b5497
server: fix streaming crashes (#13786) * add preludes to content on partial regex match * allow all parsers to parse non-tool-call content. * tweak order of <|python_tag|> vs <function= parsing for functionary v3.1 format. still not ideal but hopefully less prone to crash
b5495
`server`: fix format of streamed tool call deltas (diff name, fix id …
b5494
server: fix regression on streamed non-chat completion w/ stops (#13785) * more forgiving message diffs: partial stop words aren't erased, full stops are * Add (slow) server test for completion + stream + stop
b5493
examples : allow extracting embeddings from decoder contexts (#13797) ggml-ci
b5488
`server`: add `--reasoning-budget 0` to disable thinking (incl. qwen3…
b5479
server: fix/test add_generation_prompt
b5478
`server`: streaming of tool calls and thoughts when `--jinja` is on (…