llama_cpp_for_radxa_dragon_.../tools
2026-03-27 10:01:13 +02:00
..
batched-bench
cli
completion completion : Fix segfault on model load failure (#21049) 2026-03-27 10:01:13 +02:00
cvector-generator
export-lora
fit-params
gguf-split gguf-split : clarify operation of gguf-split (#19749) 2026-03-25 13:12:50 +02:00
imatrix imatrix : fix crash when using --show-statistics with zero counts (#19532) 2026-03-26 08:14:36 +01:00
llama-bench llama-bench: print -n-cpu-moe when offloaded layers > 1 (#20984) 2026-03-25 21:17:27 +08:00
mtmd mtmd: refactor image preprocessing (#21031) 2026-03-26 19:49:20 +01:00
parser
perplexity
quantize
results
rpc
server Send reasoning content back to the model across turns via the reasoning_content API field (#21036) 2026-03-27 08:17:35 +01:00
tokenize
tts
CMakeLists.txt