llama_cpp_for_radxa_dragon_.../tools
2026-01-10 17:51:56 +02:00
..
batched-bench
cli
completion
cvector-generator
export-lora
fit-params llama-fit-params: free memory target per device (#18679) 2026-01-08 10:07:58 +01:00
gguf-split
imatrix
llama-bench
mtmd mtmd: Add Gemma3n multimodal support with MobileNetV5 vision encoder (#18256) 2026-01-09 23:42:38 +01:00
perplexity
quantize quantize: prevent input/output file collision (#18451) 2025-12-31 23:29:03 +08:00
rpc
server server : adjust unified KV cache tests (#18716) 2026-01-10 17:51:56 +02:00
tokenize
tts
CMakeLists.txt cmake: only build cli when server is enabled (#18670) 2026-01-09 16:43:26 +01:00