llama_cpp_for_radxa_dragon_.../docs
Piotr Wilkin (ilintar) 5e54d51b19
common/parser: add proper reasoning tag prefill reading (#20424)
* Implement proper prefill extraction

* Refactor cli parameters, update docs, move reasoning budget sampler part to common/reasoning-budget.cpp

* Update tools/server/server-task.cpp

* refactor: move grammars to variant, remove grammar_external, handle exception internally

* Make code less C++y

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2026-03-19 16:58:21 +01:00
..
android
backend
development
multimodal
ops ggml webgpu: ops support for qwen3.5 (SET, TRI_SOLVE, SSM_CONV, GATED_DELTA_NET) + GET_ROWS optimization (#20687) 2026-03-19 08:45:28 -07:00
android.md
autoparser.md common/parser: add proper reasoning tag prefill reading (#20424) 2026-03-19 16:58:21 +01:00
build-riscv64-spacemit.md
build-s390x.md
build.md
docker.md docs: add information about openvino in the docker page (#20743) 2026-03-19 15:08:47 +08:00
function-calling.md
install.md
llguidance.md
multimodal.md
ops.md ggml webgpu: ops support for qwen3.5 (SET, TRI_SOLVE, SSM_CONV, GATED_DELTA_NET) + GET_ROWS optimization (#20687) 2026-03-19 08:45:28 -07:00
preset.md
speculative.md