llama_cpp_for_radxa_dragon_wing_q6a

History

Georgi Gerganov 39173bcacb context : reserve new scheduler when graph topology changes (#18547 ) * context : reserve new scheduler when graph topology changes * cont : fix * cont : fix reserve * cont : reserve only when changes occur + timing * context : add comments * llama : reserve on sampler changes * common : allow null common_sampler * server : task declares needs (embd, logits, sampling) * server : do not init sampler if not needed * llama : fix need_reserve when unsetting a sampler * server : consolidate slot reset/clear logic		2026-01-15 16:39:17 +02:00
..
batched	context : reserve new scheduler when graph topology changes (#18547 )	2026-01-15 16:39:17 +02:00
batched.swift
convert-llama2c-to-ggml
debug
deprecation-warning
diffusion
embedding
eval-callback	tests : download models only when running ctest (#18843 )	2026-01-15 09:47:29 +01:00
gen-docs
gguf
gguf-hash
idle
llama.android
llama.swiftui
lookahead
lookup
model-conversion
parallel
passkey
retrieval
save-load-state
simple
simple-chat
simple-cmake-pkg
speculative
speculative-simple
sycl
training
CMakeLists.txt
convert_legacy_llama.py
json_schema_pydantic_example.py
json_schema_to_grammar.py
llama.vim
pydantic_models_to_grammar.py
pydantic_models_to_grammar_examples.py
reason-act.sh
regex_to_grammar.py
server-llama2-13B.sh
server_embd.py
ts-type-to-grammar.sh