llama_cpp_for_radxa_dragon_wing_q6a

History

Vishal Singh f1ac84119c ggml-zendnn : add MUL_MAT_ID op support for MoE models (#21315 ) * ggml-zendnn : add MUL_MAT_ID op support for MoE models - Add MUL_MAT_ID op acceleration for Mixture-of-Experts models - MUL_MAT_ID op fallback to CPU backend if total experts > 32 - Point ZenDNN lib to latest bits ZenDNN-2026-WW13 * ggml-zendnn : add braces to sgemm failure condition for consistency Co-authored-by: Aaron Teo <taronaeo@gmail.com> --------- Co-authored-by: Aaron Teo <taronaeo@gmail.com>		2026-04-03 12:19:08 +03:00
..
android
backend	ggml-zendnn : add MUL_MAT_ID op support for MoE models (#21315 )	2026-04-03 12:19:08 +03:00
development	Autoparser - complete refactoring of parser architecture (#18675 )	2026-03-06 21:01:00 +01:00
multimodal	chore : correct typos [no ci] (#20041 )	2026-03-05 08:50:21 +01:00
ops	ggml-zendnn : add MUL_MAT_ID op support for MoE models (#21315 )	2026-04-03 12:19:08 +03:00
android.md
autoparser.md	common/parser: add proper reasoning tag prefill reading (#20424 )	2026-03-19 16:58:21 +01:00
build-riscv64-spacemit.md
build-s390x.md
build.md	Update Dawn version in WebGPU CI (#20784 )	2026-04-01 09:53:05 -07:00
docker.md	CI : Enable CUDA and Vulkan ARM64 runners and fix CI/CD (#21122 )	2026-03-30 20:24:37 +02:00
function-calling.md
install.md
llguidance.md
multimodal.md	mtmd: Add DeepSeekOCR Support (#17400 )	2026-03-25 19:57:40 +01:00
ops.md	ggml-zendnn : add MUL_MAT_ID op support for MoE models (#21315 )	2026-04-03 12:19:08 +03:00
preset.md
speculative.md