llama_cpp_for_radxa_dragon_.../common
Molly Sophia 4b0c638b9a
common : disable KV cache shifting automatically for unsupported models (#11053)
* Disable KV cache shifting automatically for unsupported models

instead of exiting directly

Signed-off-by: Molly Sophia <mollysophia379@gmail.com>

* Update common/common.cpp

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

---------

Signed-off-by: Molly Sophia <mollysophia379@gmail.com>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2025-01-03 14:13:18 +02:00
..
cmake
arg.cpp llama : refactor src/llama.cpp (#10902) 2025-01-03 10:18:53 +02:00
arg.h
base64.hpp
build-info.cpp.in
CMakeLists.txt Opt class for positional argument handling (#10508) 2024-12-13 19:34:25 +01:00
common.cpp common : disable KV cache shifting automatically for unsupported models (#11053) 2025-01-03 14:13:18 +02:00
common.h llama : refactor src/llama.cpp (#10902) 2025-01-03 10:18:53 +02:00
console.cpp
console.h
json-schema-to-grammar.cpp
json-schema-to-grammar.h
json.hpp
log.cpp
log.h
ngram-cache.cpp
ngram-cache.h
sampling.cpp sampling : refactor + optimize penalties sampler (#10803) 2024-12-16 12:31:14 +02:00
sampling.h speculative : refactor and add a simpler example (#10362) 2024-11-25 09:58:41 +02:00
speculative.cpp server : fix free of spec context and batch (#10651) 2024-12-07 11:52:44 +02:00
speculative.h speculative : refactor and add a simpler example (#10362) 2024-11-25 09:58:41 +02:00
stb_image.h