llama_cpp_for_radxa_dragon_.../examples
Pierrick Hymbert 1ecea255eb
server: health: fix race condition on slots data using tasks queue (#5634)
* server: health: fix race condition on slots data using tasks queue

* server: health:
    * include_slots only if slots_endpoint
    * fix compile warning task.target_id not initialized.
2024-02-21 15:47:48 +01:00
..
baby-llama baby-llama : allocate graphs in ggml_context (#5573) 2024-02-19 10:25:38 +02:00
batched ggml, common, examples, tests : fixed type arguments in printf (#5528) 2024-02-18 18:20:12 +02:00
batched-bench ggml, common, examples, tests : fixed type arguments in printf (#5528) 2024-02-18 18:20:12 +02:00
batched.swift ggml : add numa options (#5377) 2024-02-16 11:31:07 +02:00
beam-search ggml : add numa options (#5377) 2024-02-16 11:31:07 +02:00
benchmark
convert-llama2c-to-ggml ggml, common, examples, tests : fixed type arguments in printf (#5528) 2024-02-18 18:20:12 +02:00
embedding ggml : add numa options (#5377) 2024-02-16 11:31:07 +02:00
export-lora ci : add an option to fail on compile warning (#3952) 2024-02-17 23:03:14 +02:00
finetune finetune : rename feed-forward tensors (w1/w2/w3) (#4839) 2024-02-13 15:15:42 +02:00
gguf
imatrix ggml : add numa options (#5377) 2024-02-16 11:31:07 +02:00
infill ggml : add numa options (#5377) 2024-02-16 11:31:07 +02:00
jeopardy
llama-bench ggml : add numa options (#5377) 2024-02-16 11:31:07 +02:00
llama.android ggml : add numa options (#5377) 2024-02-16 11:31:07 +02:00
llama.swiftui ggml : add numa options (#5377) 2024-02-16 11:31:07 +02:00
llava llava : add --skip-unknown to 1.6 convert.py (#5632) 2024-02-21 15:36:57 +02:00
lookahead ggml : add numa options (#5377) 2024-02-16 11:31:07 +02:00
lookup ggml : add numa options (#5377) 2024-02-16 11:31:07 +02:00
main ggml : add numa options (#5377) 2024-02-16 11:31:07 +02:00
main-cmake-pkg
parallel ggml : add numa options (#5377) 2024-02-16 11:31:07 +02:00
passkey ggml : add numa options (#5377) 2024-02-16 11:31:07 +02:00
perplexity ci : fix wikitext url + compile warnings (#5569) 2024-02-18 22:39:30 +02:00
quantize IQ4_NL: 4-bit non-linear quants with blocks of 32 (#5590) 2024-02-21 11:39:52 +02:00
quantize-stats refactor : switch to emplace_back to avoid extra object (#5291) 2024-02-03 13:23:37 +02:00
save-load-state
server server: health: fix race condition on slots data using tasks queue (#5634) 2024-02-21 15:47:48 +01:00
simple ggml : add numa options (#5377) 2024-02-16 11:31:07 +02:00
speculative ggml : add numa options (#5377) 2024-02-16 11:31:07 +02:00
sycl [SYCL] update guide of SYCL backend (#5254) 2024-02-02 15:53:27 +08:00
tokenize ggml : add numa options (#5377) 2024-02-16 11:31:07 +02:00
train-text-from-scratch ggml, common, examples, tests : fixed type arguments in printf (#5528) 2024-02-18 18:20:12 +02:00
alpaca.sh
base-translate.sh
chat-13B.bat
chat-13B.sh
chat-persistent.sh
chat-vicuna.sh
chat.sh
CMakeLists.txt gguf : add python reader example (#5216) 2024-02-13 19:56:38 +02:00
gpt4all.sh
json-schema-to-grammar.py examples : support minItems/maxItems in JSON grammar converter (#5039) 2024-02-19 16:14:07 +02:00
llama.vim
llama2-13b.sh
llama2.sh
llm.vim
make-ggml.py
Miku.sh
pydantic-models-to-grammar-examples.py
pydantic_models_to_grammar.py
reason-act.sh
server-llama2-13B.sh