llama_cpp_for_radxa_dragon_.../tools
Pascal 5f7e166cbf
Fix thinking blocks with quotes + add handling [THINK]...[/THINK] blocks (#16326)
* fix: prevent reasoning blocks with quotes from being truncated

* chore: update webui build output

* feat: Improve thinking content parsing

* test: Adds ChatMessage component stories for different thinking blocks

* chore: update webui build output

* fix: ChatMessage story fix

---------

Co-authored-by: Aleksander Grygier <aleksander.grygier@gmail.com>
2025-09-29 18:49:47 +02:00
..
batched-bench
cvector-generator
export-lora
gguf-split ci : use smaller model (#16168) 2025-09-22 09:11:39 +03:00
imatrix
llama-bench llama-bench: add --devices and --list-devices support (#16039) 2025-09-20 00:15:21 +02:00
main llama-cli: prevent spurious assistant token (#16202) 2025-09-29 10:03:12 +03:00
mtmd mtmd : fix uninitialized variable in bicubic_resize (#16275) 2025-09-26 15:00:44 +02:00
perplexity perplexity : show more kl-divergence data (#16321) 2025-09-29 09:30:45 +03:00
quantize ci : use smaller model (#16168) 2025-09-22 09:11:39 +03:00
rpc
run
server Fix thinking blocks with quotes + add handling [THINK]...[/THINK] blocks (#16326) 2025-09-29 18:49:47 +02:00
tokenize
tts
CMakeLists.txt