llama_cpp_for_radxa_dragon_.../tools
Aidan eeee367de5
server: fix correct time_ms calculation in prompt_progress (#17093)
* fix: correct time_ms calculation in send_partial_response

The time_ms field was incorrectly calculated. The division was happening
before the subtraction leading to incorrect values.

Before: (ggml_time_us() - slot.t_start_process_prompt / 1000) After:
(ggml_time_us() - slot.t_start_process_prompt) / 1000

* docs : document time_ms field in prompt_progress
2025-11-08 15:12:11 +02:00
..
batched-bench
cvector-generator
export-lora
gguf-split
imatrix
llama-bench bench : cache the llama_context state at computed depth (#16944) 2025-11-07 21:23:11 +02:00
main
mtmd hparams : add n_embd_inp() to support extended embed (#16928) 2025-11-07 19:27:58 +01:00
perplexity
quantize
rpc
run
server server: fix correct time_ms calculation in prompt_progress (#17093) 2025-11-08 15:12:11 +02:00
tokenize
tts
CMakeLists.txt