llama_cpp_for_radxa_dragon_wing_q6a

History

rehan-10xengineer 1e796eb41f ggml-cpu: add 128-bit RVV implementation for Quantization Vector Dot (#20633 ) * ggml-cpu: add 128-bit impls for i-quants, ternary quants * ggml-cpu: add 128-bit impls for iq2_xs, iq3_s, iq3_xxs, tq2_0 Co-authored-by: Rehan Qasim <rehan.qasim@10xengineers.ai> * ggml-cpu: refactor; add rvv checks --------- Co-authored-by: taimur-10x <taimur.ahmad@10xengineers.ai> Co-authored-by: Rehan Qasim <rehan.qasim@10xengineers.ai>		2026-04-16 11:15:15 +03:00
..
cmake	ggml: backend-agnostic tensor parallelism (experimental) (#19378 )	2026-04-09 16:42:19 +02:00
include	CUDA: manage NCCL communicators in context (#21891 )	2026-04-15 15:58:40 +02:00
src	ggml-cpu: add 128-bit RVV implementation for Quantization Vector Dot (#20633 )	2026-04-16 11:15:15 +03:00
.gitignore
CMakeLists.txt	[SYCL] Fix Q8_0 reorder: garbage on 2nd prompt + crash on full VRAM (#21638 )	2026-04-16 08:34:05 +03:00