llama_cpp_for_radxa_dragon_wing_q6a

pingu_98/llama_cpp_for_radxa_dragon_wing_q6a

History

Copilot d8914fc47e common : add --override-tensor-draft, --cpu-moe-draft and --n-cpu-moe-draft parameters (#15191 ) * Checkpoint from VS Code for coding agent session * Initial plan * Fix typo in --override-tensor-draft flag implementation * Add null termination for speculative tensor buffer overrides * Apply suggestions from code review * Apply suggestions from code review * Extract tensor override parsing logic to common function (addresses @slaren's feedback) * Apply suggestions from code review * Apply suggestions --------- Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> Co-authored-by: Diego Devesa <slarengh@gmail.com>		2025-08-13 12:44:40 +02:00
..
batched
batched.swift	llama : deprecate llama_kv_self_ API (#14030 )	2025-06-06 14:11:15 +03:00
convert-llama2c-to-ggml
deprecation-warning
diffusion	Add LLaDA 8b Diffusion model (#14771 )	2025-07-31 19:49:09 +08:00
embedding	tests : update for LLAMA_SET_ROWS=1 (#14961 )	2025-07-30 15:12:02 +03:00
eval-callback	eval-callback : check for empty input (#14539 )	2025-07-05 07:18:09 +03:00
gen-docs
gguf
gguf-hash
gritlm	llama : rework embeddings logic (#14208 )	2025-06-16 14:14:00 +03:00
jeopardy	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
llama.android	llama : deprecate llama_kv_self_ API (#14030 )	2025-06-06 14:11:15 +03:00
llama.swiftui	llama : deprecate llama_kv_self_ API (#14030 )	2025-06-06 14:11:15 +03:00
lookahead	llama : deprecate llama_kv_self_ API (#14030 )	2025-06-06 14:11:15 +03:00
lookup	llama : deprecate llama_kv_self_ API (#14030 )	2025-06-06 14:11:15 +03:00
parallel	parallel : add option for different RNG seeds (#14757 )	2025-07-18 17:33:41 +03:00
passkey	llama : deprecate llama_kv_self_ API (#14030 )	2025-06-06 14:11:15 +03:00
retrieval	llama : deprecate llama_kv_self_ API (#14030 )	2025-06-06 14:11:15 +03:00
save-load-state	tests : update for LLAMA_SET_ROWS=1 (#14961 )	2025-07-30 15:12:02 +03:00
simple	fix: check model pointer validity before use (#13631 )	2025-05-19 13:25:41 +03:00
simple-chat	simple-chat : fix context-exceeded condition (#14494 )	2025-07-02 14:12:07 +03:00
simple-cmake-pkg
speculative	common : add --override-tensor-draft, --cpu-moe-draft and --n-cpu-moe-draft parameters (#15191 )	2025-08-13 12:44:40 +02:00
speculative-simple	common : add --override-tensor-draft, --cpu-moe-draft and --n-cpu-moe-draft parameters (#15191 )	2025-08-13 12:44:40 +02:00
sycl	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
training	examples/training: Fix file name in README (#13803 )	2025-05-26 16:55:24 +02:00
chat-13B.bat
chat-13B.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
chat-persistent.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
chat-vicuna.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
chat.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
CMakeLists.txt	Support diffusion models: Add Dream 7B (#14644 )	2025-07-16 20:03:51 +08:00
convert_legacy_llama.py
json_schema_pydantic_example.py
json_schema_to_grammar.py
llama.vim
llm.vim
Miku.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
pydantic_models_to_grammar.py
pydantic_models_to_grammar_examples.py
reason-act.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
regex_to_grammar.py
server-llama2-13B.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
server_embd.py
ts-type-to-grammar.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00