llama_cpp_for_radxa_dragon_wing_q6a

History

Georgi Gerganov 196f5083ef common : more accurate sampling timing (#17382 ) * common : more accurate sampling timing * eval-callback : minor fixes * cont : add time_meas impl * cont : fix log msg [no ci] * cont : fix multiple definitions of time_meas * llama-cli : exclude chat template init from time measurement * cont : print percentage of unaccounted time * cont : do not reset timings		2025-11-20 13:40:10 +02:00
..
batched
batched.swift
convert-llama2c-to-ggml
deprecation-warning
diffusion
embedding
eval-callback	common : more accurate sampling timing (#17382 )	2025-11-20 13:40:10 +02:00
gen-docs
gguf	examples(gguf): GGUF example outputs (#17025 )	2025-11-05 19:58:16 +02:00
gguf-hash
llama.android
llama.swiftui
lookahead
lookup
model-conversion
parallel
passkey
retrieval
save-load-state
simple
simple-chat
simple-cmake-pkg
speculative
speculative-simple
sycl
training
CMakeLists.txt
convert_legacy_llama.py
json_schema_pydantic_example.py
json_schema_to_grammar.py
llama.vim
pydantic_models_to_grammar.py
pydantic_models_to_grammar_examples.py
reason-act.sh
regex_to_grammar.py
server-llama2-13B.sh
server_embd.py
ts-type-to-grammar.sh