llama_cpp_for_radxa_dragon_wing_q6a

pingu_98/llama_cpp_for_radxa_dragon_wing_q6a

History

Steve Grubb 988631335a server : free llama_batch on exit (#7212 ) * [server] Cleanup a memory leak on exit There are a couple memory leaks on exit of the server. This hides others. After cleaning this up, you can see leaks on slots. But that is another patch to be sent after this. * make tab into spaces		2024-05-11 11:13:02 +03:00
..
baby-llama
batched
batched-bench	ggml : add Flash Attention (#5021 )	2024-04-30 12:16:08 +03:00
batched.swift
beam-search
benchmark
convert-llama2c-to-ggml	TypoFix (#7162 )	2024-05-09 10:16:45 +02:00
embedding	llama : add Jina Embeddings architecture (#6826 )	2024-05-11 10:46:09 +03:00
eval-callback	eval-callback : fix conversion to float (#7184 )	2024-05-10 01:04:12 +02:00
export-lora
finetune	ggml : introduce bfloat16 support (#6412 )	2024-05-08 09:30:09 +03:00
gbnf-validator
gguf
gguf-split	gguf-split: add --no-tensor-first-split (#7072 )	2024-05-04 18:56:22 +02:00
gritlm
imatrix	Fixed save_imatrix to match old behaviour for MoE (#7099 )	2024-05-08 02:24:16 +02:00
infill
jeopardy
llama-bench	llama-bench : add pp+tg test type (#7199 )	2024-05-10 18:03:54 +02:00
llama.android
llama.swiftui
llava	Fix memory bug in grammar parser (#7194 )	2024-05-10 21:01:08 +10:00
lookahead
lookup
main	Fix memory bug in grammar parser (#7194 )	2024-05-10 21:01:08 +10:00
main-cmake-pkg	build(cmake): simplify instructions (`cmake -B build && cmake --build build ...`) (#6964 )	2024-04-29 17:02:45 +01:00
parallel
passkey
perplexity	perplexity: more statistics, added documentation (#6936 )	2024-04-30 23:36:27 +02:00
quantize	ggml : introduce bfloat16 support (#6412 )	2024-05-08 09:30:09 +03:00
quantize-stats	Improve usability of --model-url & related flags (#6930 )	2024-04-30 00:52:50 +01:00
retrieval
save-load-state
server	server : free llama_batch on exit (#7212 )	2024-05-11 11:13:02 +03:00
simple
speculative
sycl	docs: fix typos (#7124 )	2024-05-07 18:20:33 +03:00
tokenize
train-text-from-scratch
alpaca.sh
base-translate.sh
chat-13B.bat
chat-13B.sh
chat-persistent.sh
chat-vicuna.sh
chat.sh
CMakeLists.txt
gpt4all.sh
json-schema-pydantic-example.py
json_schema_to_grammar.py
llama.vim
llama2-13b.sh
llama2.sh
llm.vim
make-ggml.py
Miku.sh
pydantic-models-to-grammar-examples.py
pydantic_models_to_grammar.py
reason-act.sh
regex-to-grammar.py
server-embd.py
server-llama2-13B.sh
ts-type-to-grammar.sh