llama_cpp_for_radxa_dragon_wing_q6a

History

Molly Sophia 4b0c638b9a common : disable KV cache shifting automatically for unsupported models (#11053 ) * Disable KV cache shifting automatically for unsupported models instead of exiting directly Signed-off-by: Molly Sophia <mollysophia379@gmail.com> * Update common/common.cpp Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> --------- Signed-off-by: Molly Sophia <mollysophia379@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>		2025-01-03 14:13:18 +02:00
..
cmake
arg.cpp	llama : refactor `src/llama.cpp` (#10902 )	2025-01-03 10:18:53 +02:00
arg.h
base64.hpp
build-info.cpp.in
CMakeLists.txt	Opt class for positional argument handling (#10508 )	2024-12-13 19:34:25 +01:00
common.cpp	common : disable KV cache shifting automatically for unsupported models (#11053 )	2025-01-03 14:13:18 +02:00
common.h	llama : refactor `src/llama.cpp` (#10902 )	2025-01-03 10:18:53 +02:00
console.cpp
console.h
json-schema-to-grammar.cpp
json-schema-to-grammar.h
json.hpp
log.cpp
log.h
ngram-cache.cpp
ngram-cache.h
sampling.cpp	sampling : refactor + optimize penalties sampler (#10803 )	2024-12-16 12:31:14 +02:00
sampling.h	speculative : refactor and add a simpler example (#10362 )	2024-11-25 09:58:41 +02:00
speculative.cpp	server : fix free of spec context and batch (#10651 )	2024-12-07 11:52:44 +02:00
speculative.h	speculative : refactor and add a simpler example (#10362 )	2024-11-25 09:58:41 +02:00
stb_image.h