llama_cpp_for_radxa_dragon_wing_q6a

pingu_98/llama_cpp_for_radxa_dragon_wing_q6a

History

Georgi Gerganov d0cee0d36d gguf : add 64-bit support (GGUF v2) (#2821 ) * gguf : bump version to 2 * gguf : add support for 64-bit (no backwards comp yet) * gguf : v1 backwards comp * gguf.py : bump GGUF version * gguf.py : uint64_t on all lengths, sizes and counts, enums still uint32_t * gguf.py : string lengths uint32_t * gguf : update all counts to 64-bit * gguf.py : string len uint64_t and n_dims uint32_t * gguf : fix typo * llama.cpp : print gguf version --------- Co-authored-by: klosax <131523366+klosax@users.noreply.github.com>		2023-08-27 14:19:54 +03:00
..
baby-llama
beam_search	llama : more tokenizer fixes (#2810 )	2023-08-27 14:19:19 +03:00
benchmark
convert-llama2c-to-ggml	minor : fix trailing whitespace	2023-08-23 22:37:39 +03:00
embd-input	llama : more tokenizer fixes (#2810 )	2023-08-27 14:19:19 +03:00
embedding	llama : more tokenizer fixes (#2810 )	2023-08-27 14:19:19 +03:00
gguf	gguf : add 64-bit support (GGUF v2) (#2821 )	2023-08-27 14:19:54 +03:00
gptneox-wip
jeopardy
llama-bench	llama-bench : add model sizes (#2771 )	2023-08-25 15:16:19 +02:00
main	llama : more tokenizer fixes (#2810 )	2023-08-27 14:19:19 +03:00
metal
perplexity	llama : more tokenizer fixes (#2810 )	2023-08-27 14:19:19 +03:00
quantize
quantize-stats
save-load-state	llama : more tokenizer fixes (#2810 )	2023-08-27 14:19:19 +03:00
server	llama : more tokenizer fixes (#2810 )	2023-08-27 14:19:19 +03:00
simple	llama : more tokenizer fixes (#2810 )	2023-08-27 14:19:19 +03:00
train-text-from-scratch	llama : more tokenizer fixes (#2810 )	2023-08-27 14:19:19 +03:00
alpaca.sh
chat-13B.bat
chat-13B.sh
chat-persistent.sh
chat-vicuna.sh
chat.sh
CMakeLists.txt	llama : add llama_beam_search() (#2267 )	2023-08-25 18:18:48 +03:00
gpt4all.sh
json-schema-to-grammar.py
llama.vim
llama2-13b.sh
llama2.sh
llm.vim
make-ggml.py
Miku.sh
reason-act.sh
server-llama2-13B.sh