llama_cpp_for_radxa_dragon_wing_q6a

History

Aidan eeee367de5 server: fix correct time_ms calculation in prompt_progress (#17093 ) * fix: correct time_ms calculation in send_partial_response The time_ms field was incorrectly calculated. The division was happening before the subtraction leading to incorrect values. Before: (ggml_time_us() - slot.t_start_process_prompt / 1000) After: (ggml_time_us() - slot.t_start_process_prompt) / 1000 * docs : document time_ms field in prompt_progress		2025-11-08 15:12:11 +02:00
..
batched-bench
cvector-generator
export-lora
gguf-split
imatrix
llama-bench	bench : cache the llama_context state at computed depth (#16944 )	2025-11-07 21:23:11 +02:00
main
mtmd	hparams : add n_embd_inp() to support extended embed (#16928 )	2025-11-07 19:27:58 +01:00
perplexity
quantize
rpc
run
server	server: fix correct time_ms calculation in prompt_progress (#17093 )	2025-11-08 15:12:11 +02:00
tokenize
tts
CMakeLists.txt