llama_cpp_for_radxa_dragon_wing_q6a

History

George e9a859db3c ggml: added cleanups in ggml_quantize_free (#19278 ) Add missing cleanup calls for IQ2_S, IQ1_M quantization types and IQ3XS with 512 blocks during quantization cleanup.		2026-02-03 08:43:39 +02:00
..
cmake
include	ggml-cpu: FA split across kv for faster TG (#19209 )	2026-02-03 01:19:55 +08:00
src	ggml: added cleanups in ggml_quantize_free (#19278 )	2026-02-03 08:43:39 +02:00
.gitignore
CMakeLists.txt	Bump cmake max version (needed for Windows on Snapdragon builds) (#19188 )	2026-02-01 14:13:38 -08:00