llama_cpp_for_radxa_dragon_wing_q6a

History

Kawrakow 326b418b59 Importance Matrix calculation (#4861 ) * imatrix: 1st version * imatrix: WIP * Cleanup * Update examples/imatrix/imatrix.cpp Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>		2024-01-12 06:59:57 +01:00
..
baby-llama
batched
batched-bench
batched.swift
beam-search
benchmark
convert-llama2c-to-ggml
embedding
export-lora
finetune
gguf
imatrix	Importance Matrix calculation (#4861 )	2024-01-12 06:59:57 +01:00
infill
jeopardy
llama-bench
llama.swiftui	llama.swiftui : update readme	2024-01-08 15:57:36 +02:00
llava	clip : support more quantization types (#4846 )	2024-01-10 15:37:09 +02:00
lookahead
lookup
main	main : better name for variable n_print (#4874 )	2024-01-11 22:46:26 +02:00
main-cmake-pkg
metal
parallel
passkey
perplexity
quantize	llama : restore intended k-quants mixes for MoE models (#4872 )	2024-01-11 21:43:15 +02:00
quantize-stats
save-load-state
server	server : fix infill when prompt is empty (#4833 )	2024-01-11 23:23:49 +02:00
simple
speculative
tokenize
train-text-from-scratch
alpaca.sh
base-translate.sh
chat-13B.bat
chat-13B.sh
chat-persistent.sh
chat-vicuna.sh
chat.sh
CMakeLists.txt	Importance Matrix calculation (#4861 )	2024-01-12 06:59:57 +01:00
gpt4all.sh
json-schema-to-grammar.py
llama.vim
llama2-13b.sh
llama2.sh
llm.vim
make-ggml.py
Miku.sh
reason-act.sh
server-llama2-13B.sh