llama_cpp_for_radxa_dragon_wing_q6a

pingu_98/llama_cpp_for_radxa_dragon_wing_q6a

History

Andrew Godfrey 73bdcb395e finetune : add -ngl parameter (#3762 ) * Add '-ngl' support to finetune.cpp * Add fprintf in ggml_cuda_op_add When I tried CUDA offloading during finetuning following the readme, I got an assert here. This probably isn't an important case because inference later gives a warning saying you should use f16 or f32 instead when using lora * Add 'finetune.sh', which currently fails when using GPU "error: operator (): Finetuning on tensors with type 'f16' is not yet supported" * tweak finetune.sh * Suppress some warnings in ggml.c * Add f16 implementation to ggml_compute_forward_add_f16_f32 * Add an f16 case to ggml_add_cast_impl and llama_build_lora_finetune_graphs * finetune.sh: Edit comments * Add "add_f16_f32_f32_cuda" * Tweak an error message * finetune.sh: Add an optional LLAMA_MODEL_DIR variable * finetune.sh: Add an optional LLAMA_TRAINING_DIR variable * train : minor * tabs to spaces --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> Co-authored-by: cebtenzzre <cebtenzzre@gmail.com>		2023-11-01 13:49:04 +02:00
..
baby-llama
batched
batched-bench	Extend llama_kv_cache_seq_rm to allow matching any sequence (#3843 )	2023-10-29 11:31:40 -06:00
batched.swift
beam-search
benchmark
convert-llama2c-to-ggml
embedding
export-lora
finetune	finetune : add -ngl parameter (#3762 )	2023-11-01 13:49:04 +02:00
gguf
infill
jeopardy
llama-bench	Extend llama_kv_cache_seq_rm to allow matching any sequence (#3843 )	2023-10-29 11:31:40 -06:00
llava
main	samplers : Min-P sampler implementation [alternative to Top P/Top K] (#3841 )	2023-10-31 20:44:49 +01:00
main-cmake-pkg
metal
parallel
perplexity	Extend llama_kv_cache_seq_rm to allow matching any sequence (#3843 )	2023-10-29 11:31:40 -06:00
quantize	ggml : quantization refactoring (#3833 )	2023-10-29 18:32:28 +02:00
quantize-stats
save-load-state
server	server : re-enable completion and embedded at the same time (#3876 )	2023-11-01 11:28:28 +02:00
simple
speculative
train-text-from-scratch
alpaca.sh
chat-13B.bat
chat-13B.sh
chat-persistent.sh
chat-vicuna.sh
chat.sh
CMakeLists.txt
gpt4all.sh
json-schema-to-grammar.py
llama.vim
llama2-13b.sh
llama2.sh
llm.vim
make-ggml.py
Miku.sh
reason-act.sh
server-llama2-13B.sh