llama_cpp_for_radxa_dragon_wing_q6a

History

Pascal 5f7e166cbf Fix thinking blocks with quotes + add handling `[THINK]...[/THINK]` blocks (#16326 ) * fix: prevent reasoning blocks with quotes from being truncated * chore: update webui build output * feat: Improve thinking content parsing * test: Adds ChatMessage component stories for different thinking blocks * chore: update webui build output * fix: ChatMessage story fix --------- Co-authored-by: Aleksander Grygier <aleksander.grygier@gmail.com>		2025-09-29 18:49:47 +02:00
..
batched-bench
cvector-generator
export-lora
gguf-split	ci : use smaller model (#16168 )	2025-09-22 09:11:39 +03:00
imatrix
llama-bench	llama-bench: add --devices and --list-devices support (#16039 )	2025-09-20 00:15:21 +02:00
main	llama-cli: prevent spurious assistant token (#16202 )	2025-09-29 10:03:12 +03:00
mtmd	mtmd : fix uninitialized variable in bicubic_resize (#16275 )	2025-09-26 15:00:44 +02:00
perplexity	perplexity : show more kl-divergence data (#16321 )	2025-09-29 09:30:45 +03:00
quantize	ci : use smaller model (#16168 )	2025-09-22 09:11:39 +03:00
rpc
run
server	Fix thinking blocks with quotes + add handling `[THINK]...[/THINK]` blocks (#16326 )	2025-09-29 18:49:47 +02:00
tokenize
tts
CMakeLists.txt