llama_cpp_for_radxa_dragon_.../examples
Georgi Gerganov e92d53b29e
sampling : optimize samplers by reusing bucket sort (#15665)
* sampling : optimize sorting using bucket sort in more places

ggml-ci

* sampling : do not sort in dist sampler

ggml-ci

* sampling : avoid heap allocations for sort buffers

ggml-ci

* common : add option to sort sampling candidates by probability

ggml-ci

* sampling : revert the change for preserving sort buffers

* sampling : use std::copy instead of memcpy

* sampling : clarify purpose of partial sort helpers

ggml-ci

* cont : remove wrong comment [no ci]

* common : update comment

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

---------

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
2025-08-31 20:41:02 +03:00
..
batched
batched.swift examples : remove references to make in examples [no ci] (#15457) 2025-08-21 06:12:28 +02:00
convert-llama2c-to-ggml
deprecation-warning
diffusion llama: use FA + max. GPU layers by default (#15434) 2025-08-30 16:32:10 +02:00
embedding
eval-callback fix: Compute the full sum in llama-eval-callback, not just the sum of printed values (#15637) 2025-08-28 15:27:36 -05:00
gen-docs
gguf
gguf-hash
gritlm
jeopardy
llama.android
llama.swiftui
lookahead lookahead : add sample command to readme (#15447) 2025-08-20 13:30:46 +03:00
lookup
model-conversion model-conversion : add mmproj conversion target (#15628) 2025-08-28 09:26:48 +02:00
parallel
passkey examples : remove references to make in examples [no ci] (#15457) 2025-08-21 06:12:28 +02:00
retrieval examples : remove references to make in examples [no ci] (#15457) 2025-08-21 06:12:28 +02:00
save-load-state
simple
simple-chat
simple-cmake-pkg
speculative sampling : optimize samplers by reusing bucket sort (#15665) 2025-08-31 20:41:02 +03:00
speculative-simple
sycl examples : remove references to make in examples [no ci] (#15457) 2025-08-21 06:12:28 +02:00
training
chat-13B.bat
chat-13B.sh
chat-persistent.sh
chat-vicuna.sh
chat.sh
CMakeLists.txt examples : add model conversion tool/example (#15455) 2025-08-21 12:16:54 +02:00
convert_legacy_llama.py
json_schema_pydantic_example.py
json_schema_to_grammar.py
llama.vim llama : remove KV cache defragmentation logic (#15473) 2025-08-22 12:22:13 +03:00
llm.vim
Miku.sh
pydantic_models_to_grammar.py
pydantic_models_to_grammar_examples.py
reason-act.sh
regex_to_grammar.py
server-llama2-13B.sh
server_embd.py
ts-type-to-grammar.sh