8000 `server`: add `--reasoning-budget 0` to disable thinking (incl. qwen3 w/ enable_thinking:false) by ochafik · Pull Request #13771 · ggml-org/llama.cpp · GitHub
[go: up one dir, main page]

Skip to content

server: add --reasoning-budget 0 to disable thinking (incl. qwen3 w/ enable_thinking:false)#13771

Merged
ochafik merged 13 commits intoggml-org:masterfrom
ochafik:enable-thinking
May 25, 2025

Commits

Commits on May 25, 2025

0