llama_cpp_for_radxa_dragon_.../examples
Copilot d8914fc47e
common : add --override-tensor-draft, --cpu-moe-draft and --n-cpu-moe-draft parameters (#15191)
* Checkpoint from VS Code for coding agent session

* Initial plan

* Fix typo in --override-tensor-draft flag implementation

* Add null termination for speculative tensor buffer overrides

* Apply suggestions from code review

* Apply suggestions from code review

* Extract tensor override parsing logic to common function (addresses @slaren's feedback)

* Apply suggestions from code review

* Apply suggestions

---------

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Co-authored-by: Diego Devesa <slarengh@gmail.com>
2025-08-13 12:44:40 +02:00
..
batched
batched.swift llama : deprecate llama_kv_self_ API (#14030) 2025-06-06 14:11:15 +03:00
convert-llama2c-to-ggml
deprecation-warning
diffusion Add LLaDA 8b Diffusion model (#14771) 2025-07-31 19:49:09 +08:00
embedding tests : update for LLAMA_SET_ROWS=1 (#14961) 2025-07-30 15:12:02 +03:00
eval-callback eval-callback : check for empty input (#14539) 2025-07-05 07:18:09 +03:00
gen-docs
gguf
gguf-hash
gritlm llama : rework embeddings logic (#14208) 2025-06-16 14:14:00 +03:00
jeopardy scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
llama.android llama : deprecate llama_kv_self_ API (#14030) 2025-06-06 14:11:15 +03:00
llama.swiftui llama : deprecate llama_kv_self_ API (#14030) 2025-06-06 14:11:15 +03:00
lookahead llama : deprecate llama_kv_self_ API (#14030) 2025-06-06 14:11:15 +03:00
lookup llama : deprecate llama_kv_self_ API (#14030) 2025-06-06 14:11:15 +03:00
parallel parallel : add option for different RNG seeds (#14757) 2025-07-18 17:33:41 +03:00
passkey llama : deprecate llama_kv_self_ API (#14030) 2025-06-06 14:11:15 +03:00
retrieval llama : deprecate llama_kv_self_ API (#14030) 2025-06-06 14:11:15 +03:00
save-load-state tests : update for LLAMA_SET_ROWS=1 (#14961) 2025-07-30 15:12:02 +03:00
simple fix: check model pointer validity before use (#13631) 2025-05-19 13:25:41 +03:00
simple-chat simple-chat : fix context-exceeded condition (#14494) 2025-07-02 14:12:07 +03:00
simple-cmake-pkg
speculative common : add --override-tensor-draft, --cpu-moe-draft and --n-cpu-moe-draft parameters (#15191) 2025-08-13 12:44:40 +02:00
speculative-simple common : add --override-tensor-draft, --cpu-moe-draft and --n-cpu-moe-draft parameters (#15191) 2025-08-13 12:44:40 +02:00
sycl scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
training examples/training: Fix file name in README (#13803) 2025-05-26 16:55:24 +02:00
chat-13B.bat
chat-13B.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
chat-persistent.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
chat-vicuna.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
chat.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
CMakeLists.txt Support diffusion models: Add Dream 7B (#14644) 2025-07-16 20:03:51 +08:00
convert_legacy_llama.py
json_schema_pydantic_example.py
json_schema_to_grammar.py
llama.vim
llm.vim
Miku.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
pydantic_models_to_grammar.py
pydantic_models_to_grammar_examples.py
reason-act.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
regex_to_grammar.py
server-llama2-13B.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00
server_embd.py
ts-type-to-grammar.sh scripts : make the shell scripts cross-platform (#14341) 2025-06-30 10:17:18 +02:00