llama_cpp_for_radxa_dragon_.../common
Jillis ter Hove 52e5f0a5c1
common : re-arm reasoning budget after DONE on new <think> (#22323)
DONE state absorbs all tokens including a new start tag, causing any think blocks after the first to run unbudgeted. Observed on unsloth/Qwen3.6-27B-GGUF which interleaves multiple <think> blocks per response.

Fixed by advancing start_matcher in DONE branch and re-arming to COUNTING with a fresh budget on match. Adds regression test (test-reasoning-budget: test 6).
2026-04-28 19:15:36 +02:00
..
jinja common : fix jinja warnings with clang 21 (#22313) 2026-04-24 12:36:02 +02:00
arg.cpp spec : refactor params (#22397) 2026-04-28 09:07:33 +03:00
arg.h spec : refactor params (#22397) 2026-04-28 09:07:33 +03:00
base64.hpp
build-info.cpp.in
build-info.h
chat-auto-parser-generator.cpp
chat-auto-parser-helpers.cpp
chat-auto-parser-helpers.h
chat-auto-parser.h
chat-diff-analyzer.cpp chat: fix handling of space in reasoning markers (#22353) 2026-04-25 21:24:13 +02:00
chat-peg-parser.cpp
chat-peg-parser.h
chat.cpp server: Enable transcriptions API for LFM2-Audio (#22000) 2026-04-23 10:47:26 +02:00
chat.h server: Enable transcriptions API for LFM2-Audio (#22000) 2026-04-23 10:47:26 +02:00
CMakeLists.txt
common.cpp spec : refactor params (#22397) 2026-04-28 09:07:33 +03:00
common.h spec : refactor params (#22397) 2026-04-28 09:07:33 +03:00
console.cpp
console.h
debug.cpp common: fix missing exports in llama-common (#22340) 2026-04-27 08:06:39 +03:00
debug.h common: fix missing exports in llama-common (#22340) 2026-04-27 08:06:39 +03:00
download.cpp download : prefer q8_0 when q4_k not available (#22428) 2026-04-27 14:30:29 +02:00
download.h
fit.cpp Fix type casting for unaccounted memory calculation (#22424) 2026-04-27 14:31:13 +02:00
fit.h
hf-cache.cpp
hf-cache.h
http.h
json-partial.cpp
json-partial.h
json-schema-to-grammar.cpp
json-schema-to-grammar.h
llguidance.cpp
log.cpp
log.h
ngram-cache.cpp
ngram-cache.h
ngram-map.cpp
ngram-map.h
ngram-mod.cpp
ngram-mod.h
peg-parser.cpp
peg-parser.h
preset.cpp spec : refactor params (#22397) 2026-04-28 09:07:33 +03:00
preset.h
reasoning-budget.cpp common : re-arm reasoning budget after DONE on new <think> (#22323) 2026-04-28 19:15:36 +02:00
reasoning-budget.h
regex-partial.cpp
regex-partial.h
sampling.cpp
sampling.h
speculative.cpp spec : refactor params (#22397) 2026-04-28 09:07:33 +03:00
speculative.h spec : refactor params (#22397) 2026-04-28 09:07:33 +03:00
unicode.cpp
unicode.h