llama_cpp_for_radxa_dragon_wing_q6a

History

Georgi Gerganov d69d777c02 ggml : quantization refactoring (#3833 ) * ggml : factor all quantization code in ggml-quants ggml-ci * ggml-quants : fix Zig and Swift builds + quantize tool ggml-ci * quantize : --pure option for disabling k-quant mixtures --------- Co-authored-by: cebtenzzre <cebtenzzre@gmail.com>		2023-10-29 18:32:28 +02:00
..
baby-llama
batched
batched-bench
batched.swift
beam-search
benchmark
convert-llama2c-to-ggml
embedding
export-lora
finetune
gguf
infill
jeopardy
llama-bench
llava
main
main-cmake-pkg
metal
parallel
perplexity
quantize	ggml : quantization refactoring (#3833 )	2023-10-29 18:32:28 +02:00
quantize-stats
save-load-state
server
simple
speculative
train-text-from-scratch
alpaca.sh
chat-13B.bat
chat-13B.sh
chat-persistent.sh
chat-vicuna.sh
chat.sh
CMakeLists.txt
gpt4all.sh
json-schema-to-grammar.py
llama.vim
llama2-13b.sh
llama2.sh
llm.vim
make-ggml.py
Miku.sh
reason-act.sh
server-llama2-13B.sh