llama_cpp_for_radxa_dragon_.../examples
Kawrakow 326b418b59
Importance Matrix calculation (#4861)
* imatrix: 1st version

* imatrix: WIP

* Cleanup

* Update examples/imatrix/imatrix.cpp

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

---------

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2024-01-12 06:59:57 +01:00
..
baby-llama
batched
batched-bench
batched.swift
beam-search
benchmark
convert-llama2c-to-ggml
embedding
export-lora
finetune
gguf
imatrix Importance Matrix calculation (#4861) 2024-01-12 06:59:57 +01:00
infill
jeopardy
llama-bench
llama.swiftui llama.swiftui : update readme 2024-01-08 15:57:36 +02:00
llava clip : support more quantization types (#4846) 2024-01-10 15:37:09 +02:00
lookahead
lookup
main main : better name for variable n_print (#4874) 2024-01-11 22:46:26 +02:00
main-cmake-pkg
metal
parallel
passkey
perplexity
quantize llama : restore intended k-quants mixes for MoE models (#4872) 2024-01-11 21:43:15 +02:00
quantize-stats
save-load-state
server server : fix infill when prompt is empty (#4833) 2024-01-11 23:23:49 +02:00
simple
speculative
tokenize
train-text-from-scratch
alpaca.sh
base-translate.sh
chat-13B.bat
chat-13B.sh
chat-persistent.sh
chat-vicuna.sh
chat.sh
CMakeLists.txt Importance Matrix calculation (#4861) 2024-01-12 06:59:57 +01:00
gpt4all.sh
json-schema-to-grammar.py
llama.vim
llama2-13b.sh
llama2.sh
llm.vim
make-ggml.py
Miku.sh
reason-act.sh
server-llama2-13B.sh