llama_cpp_for_radxa_dragon_wing_q6a

History

Xuan-Son Nguyen e974923698 docs: listing qwen3-asr and qwen3-omni as supported (#21857 ) * docs: listing qwen3-asr and qwen3-omni as supported * nits		2026-04-13 22:28:17 +02:00
..
batched-bench
cli	server: save and clear idle slots on new task (`--clear-idle`) (#20993 )	2026-04-03 19:02:27 +02:00
completion	server: save and clear idle slots on new task (`--clear-idle`) (#20993 )	2026-04-03 19:02:27 +02:00
cvector-generator
export-lora
fit-params
gguf-split
imatrix
llama-bench	common : add callback interface for download progress (#21735 )	2026-04-10 22:17:00 +02:00
mtmd	docs: listing qwen3-asr and qwen3-omni as supported (#21857 )	2026-04-13 22:28:17 +02:00
parser	common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230 )	2026-04-03 17:51:52 +02:00
perplexity	ggml: backend-agnostic tensor parallelism (experimental) (#19378 )	2026-04-09 16:42:19 +02:00
quantize	ggml: add Q1_0 1-bit quantization support (CPU) (#21273 )	2026-04-06 20:55:21 +02:00
results
rpc
server	server: Expose build_info in router mode (#21835 )	2026-04-13 11:14:42 +02:00
tokenize
tts
CMakeLists.txt