llama_cpp_for_radxa_dragon_.../tools
Xuan-Son Nguyen e974923698
docs: listing qwen3-asr and qwen3-omni as supported (#21857)
* docs: listing qwen3-asr and qwen3-omni as supported

* nits
2026-04-13 22:28:17 +02:00
..
batched-bench
cli server: save and clear idle slots on new task (--clear-idle) (#20993) 2026-04-03 19:02:27 +02:00
completion server: save and clear idle slots on new task (--clear-idle) (#20993) 2026-04-03 19:02:27 +02:00
cvector-generator
export-lora
fit-params
gguf-split
imatrix
llama-bench common : add callback interface for download progress (#21735) 2026-04-10 22:17:00 +02:00
mtmd docs: listing qwen3-asr and qwen3-omni as supported (#21857) 2026-04-13 22:28:17 +02:00
parser common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230) 2026-04-03 17:51:52 +02:00
perplexity ggml: backend-agnostic tensor parallelism (experimental) (#19378) 2026-04-09 16:42:19 +02:00
quantize ggml: add Q1_0 1-bit quantization support (CPU) (#21273) 2026-04-06 20:55:21 +02:00
results
rpc
server server: Expose build_info in router mode (#21835) 2026-04-13 11:14:42 +02:00
tokenize
tts
CMakeLists.txt