llama_cpp_for_radxa_dragon_.../examples
Georgi Gerganov d0cee0d36d
gguf : add 64-bit support (GGUF v2) (#2821)
* gguf : bump version to 2

* gguf : add support for 64-bit (no backwards comp yet)

* gguf : v1 backwards comp

* gguf.py : bump GGUF version

* gguf.py : uint64_t on all lengths, sizes and counts, enums still uint32_t

* gguf.py : string lengths uint32_t

* gguf : update all counts to 64-bit

* gguf.py : string len uint64_t and n_dims uint32_t

* gguf : fix typo

* llama.cpp : print gguf version

---------

Co-authored-by: klosax <131523366+klosax@users.noreply.github.com>
2023-08-27 14:19:54 +03:00
..
baby-llama
beam_search llama : more tokenizer fixes (#2810) 2023-08-27 14:19:19 +03:00
benchmark
convert-llama2c-to-ggml minor : fix trailing whitespace 2023-08-23 22:37:39 +03:00
embd-input llama : more tokenizer fixes (#2810) 2023-08-27 14:19:19 +03:00
embedding llama : more tokenizer fixes (#2810) 2023-08-27 14:19:19 +03:00
gguf gguf : add 64-bit support (GGUF v2) (#2821) 2023-08-27 14:19:54 +03:00
gptneox-wip
jeopardy
llama-bench llama-bench : add model sizes (#2771) 2023-08-25 15:16:19 +02:00
main llama : more tokenizer fixes (#2810) 2023-08-27 14:19:19 +03:00
metal
perplexity llama : more tokenizer fixes (#2810) 2023-08-27 14:19:19 +03:00
quantize
quantize-stats
save-load-state llama : more tokenizer fixes (#2810) 2023-08-27 14:19:19 +03:00
server llama : more tokenizer fixes (#2810) 2023-08-27 14:19:19 +03:00
simple llama : more tokenizer fixes (#2810) 2023-08-27 14:19:19 +03:00
train-text-from-scratch llama : more tokenizer fixes (#2810) 2023-08-27 14:19:19 +03:00
alpaca.sh
chat-13B.bat
chat-13B.sh
chat-persistent.sh
chat-vicuna.sh
chat.sh
CMakeLists.txt llama : add llama_beam_search() (#2267) 2023-08-25 18:18:48 +03:00
gpt4all.sh
json-schema-to-grammar.py
llama.vim
llama2-13b.sh
llama2.sh
llm.vim
make-ggml.py
Miku.sh
reason-act.sh
server-llama2-13B.sh