llama_cpp_for_radxa_dragon_.../common
Johannes Gäßler 026d2ad472
llama: fix magic number of 999 for GPU layers (#18266)
* llama: fix magic number of 999 for GPU layers

* use strings for -ngl, -ngld

* enacapsulate n_gpu_layers, split_mode
2025-12-27 20:18:35 +01:00
..
arg.cpp llama: fix magic number of 999 for GPU layers (#18266) 2025-12-27 20:18:35 +01:00
arg.h server: (router) add stop-timeout option (#18350) 2025-12-24 23:47:49 +01:00
base64.hpp
build-info.cpp.in
chat-parser-xml-toolcall.cpp
chat-parser-xml-toolcall.h
chat-parser.cpp
chat-parser.h
chat-peg-parser.cpp
chat-peg-parser.h
chat.cpp
chat.h
CMakeLists.txt common : reorganize includes to prioritize vendored deps (#18222) 2025-12-20 21:43:21 -06:00
common.cpp llama: fix magic number of 999 for GPU layers (#18266) 2025-12-27 20:18:35 +01:00
common.h llama: fix magic number of 999 for GPU layers (#18266) 2025-12-27 20:18:35 +01:00
console.cpp
console.h
download.cpp
download.h
http.h
json-partial.cpp
json-partial.h
json-schema-to-grammar.cpp
json-schema-to-grammar.h
llguidance.cpp
log.cpp
log.h
ngram-cache.cpp
ngram-cache.h
peg-parser.cpp
peg-parser.h
preset.cpp server: support load model on startup, support preset-only options (#18206) 2025-12-20 09:25:27 +01:00
preset.h presets: refactor, allow cascade presets from different sources, add global section (#18169) 2025-12-19 12:08:20 +01:00
regex-partial.cpp
regex-partial.h
sampling.cpp
sampling.h
speculative.cpp
speculative.h
unicode.cpp
unicode.h