llama_cpp_for_radxa_dragon_.../ggml
George e9a859db3c
ggml: added cleanups in ggml_quantize_free (#19278)
Add missing cleanup calls for IQ2_S, IQ1_M quantization types and IQ3XS with 512 blocks during quantization cleanup.
2026-02-03 08:43:39 +02:00
..
cmake
include ggml-cpu: FA split across kv for faster TG (#19209) 2026-02-03 01:19:55 +08:00
src ggml: added cleanups in ggml_quantize_free (#19278) 2026-02-03 08:43:39 +02:00
.gitignore
CMakeLists.txt Bump cmake max version (needed for Windows on Snapdragon builds) (#19188) 2026-02-01 14:13:38 -08:00