llama_cpp_for_radxa_dragon_.../examples
Steve Grubb 988631335a
server : free llama_batch on exit (#7212)
* [server] Cleanup a memory leak on exit

There are a couple memory leaks on exit of the server. This hides others.
After cleaning this up, you can see leaks on slots. But that is another
patch to be sent after this.

* make tab into spaces
2024-05-11 11:13:02 +03:00
..
baby-llama
batched
batched-bench ggml : add Flash Attention (#5021) 2024-04-30 12:16:08 +03:00
batched.swift
beam-search
benchmark
convert-llama2c-to-ggml TypoFix (#7162) 2024-05-09 10:16:45 +02:00
embedding llama : add Jina Embeddings architecture (#6826) 2024-05-11 10:46:09 +03:00
eval-callback eval-callback : fix conversion to float (#7184) 2024-05-10 01:04:12 +02:00
export-lora
finetune ggml : introduce bfloat16 support (#6412) 2024-05-08 09:30:09 +03:00
gbnf-validator
gguf
gguf-split gguf-split: add --no-tensor-first-split (#7072) 2024-05-04 18:56:22 +02:00
gritlm
imatrix Fixed save_imatrix to match old behaviour for MoE (#7099) 2024-05-08 02:24:16 +02:00
infill
jeopardy
llama-bench llama-bench : add pp+tg test type (#7199) 2024-05-10 18:03:54 +02:00
llama.android
llama.swiftui
llava Fix memory bug in grammar parser (#7194) 2024-05-10 21:01:08 +10:00
lookahead
lookup
main Fix memory bug in grammar parser (#7194) 2024-05-10 21:01:08 +10:00
main-cmake-pkg build(cmake): simplify instructions (cmake -B build && cmake --build build ...) (#6964) 2024-04-29 17:02:45 +01:00
parallel
passkey
perplexity perplexity: more statistics, added documentation (#6936) 2024-04-30 23:36:27 +02:00
quantize ggml : introduce bfloat16 support (#6412) 2024-05-08 09:30:09 +03:00
quantize-stats Improve usability of --model-url & related flags (#6930) 2024-04-30 00:52:50 +01:00
retrieval
save-load-state
server server : free llama_batch on exit (#7212) 2024-05-11 11:13:02 +03:00
simple
speculative
sycl docs: fix typos (#7124) 2024-05-07 18:20:33 +03:00
tokenize
train-text-from-scratch
alpaca.sh
base-translate.sh
chat-13B.bat
chat-13B.sh
chat-persistent.sh
chat-vicuna.sh
chat.sh
CMakeLists.txt
gpt4all.sh
json-schema-pydantic-example.py
json_schema_to_grammar.py
llama.vim
llama2-13b.sh
llama2.sh
llm.vim
make-ggml.py
Miku.sh
pydantic-models-to-grammar-examples.py
pydantic_models_to_grammar.py
reason-act.sh
regex-to-grammar.py
server-embd.py
server-llama2-13B.sh
ts-type-to-grammar.sh