llama_cpp_for_radxa_dragon_.../examples
ardfork 978ba3d83d
Server: Don't ignore llama.cpp params (#8754)
* Don't ignore llama.cpp params

* Add fallback for max_tokens
2024-08-04 20:16:23 +02:00
..
baby-llama baby-llama : remove duplicate vector include 2024-08-04 13:24:59 +03:00
batched
batched-bench batched-bench : handle empty -npl (#8839) 2024-08-04 13:55:03 +03:00
batched.swift
benchmark
convert-llama2c-to-ggml
cvector-generator
deprecation-warning examples : remove finetune and train-text-from-scratch (#8669) 2024-07-25 10:39:04 +02:00
embedding
eval-callback ggml : reduce hash table reset cost (#8698) 2024-07-27 04:41:55 +02:00
export-lora examples : export-lora : fix issue with quantized base models (#8687) 2024-07-25 23:49:39 +02:00
gbnf-validator
gguf
gguf-hash
gguf-split
gritlm
imatrix ggml : reduce hash table reset cost (#8698) 2024-07-27 04:41:55 +02:00
infill
jeopardy
llama-bench ggml : reduce hash table reset cost (#8698) 2024-07-27 04:41:55 +02:00
llama.android
llama.swiftui
llava ggml : reduce hash table reset cost (#8698) 2024-07-27 04:41:55 +02:00
lookahead
lookup
main llama : fix llama_chat_format_single for mistral (#8657) 2024-07-24 13:48:46 +02:00
main-cmake-pkg
parallel
passkey
perplexity
quantize
quantize-stats
retrieval
rpc
save-load-state llama : refactor session file management (#8699) 2024-07-28 00:42:05 -04:00
server Server: Don't ignore llama.cpp params (#8754) 2024-08-04 20:16:23 +02:00
simple
speculative
sycl
tokenize ggml : reduce hash table reset cost (#8698) 2024-07-27 04:41:55 +02:00
base-translate.sh
chat-13B.bat
chat-13B.sh
chat-persistent.sh
chat-vicuna.sh
chat.sh
CMakeLists.txt examples : remove finetune and train-text-from-scratch (#8669) 2024-07-25 10:39:04 +02:00
convert_legacy_llama.py
json_schema_pydantic_example.py
json_schema_to_grammar.py
llama.vim
llm.vim
Miku.sh
pydantic_models_to_grammar.py
pydantic_models_to_grammar_examples.py
reason-act.sh
regex_to_grammar.py
server-llama2-13B.sh
server_embd.py
ts-type-to-grammar.sh