llama_cpp_for_radxa_dragon_.../tools
2026-02-25 15:15:42 +02:00
..
batched-bench
cli cli : provide model with text filename (#19783) 2026-02-22 22:33:49 +01:00
completion llama : remove write/read of output ids/logits/embeddings (#18862) 2026-02-23 07:04:30 +01:00
cvector-generator docs : Minor cleanups (#19252) 2026-02-02 08:38:55 +02:00
export-lora docs : Minor cleanups (#19252) 2026-02-02 08:38:55 +02:00
fit-params llama-fit-params: keep explicit --ctx-size 0 (#19070) 2026-01-24 22:13:08 +01:00
gguf-split
imatrix
llama-bench
mtmd model: Add PaddleOCR-VL model support (#18825) 2026-02-19 17:05:25 +01:00
perplexity perplexity: add proper batching (#19661) 2026-02-16 18:44:44 +02:00
quantize quantize : add --dry-run option (#19526) 2026-02-20 09:20:16 +01:00
rpc NetBSD build support (#19589) 2026-02-14 09:47:01 +01:00
server server : enable multi-modal prompt caching (#19877) 2026-02-25 15:15:42 +02:00
tokenize
tts model : fix wavtokenizer embedding notions (#19479) 2026-02-11 07:52:20 +02:00
CMakeLists.txt