..
peg-parser
common : implement new jinja template engine ( #18462 )
2026-01-16 11:22:06 +01:00
.gitignore
common : introduce composable PEG parser combinators for chat parsing ( #17136 )
2025-12-03 12:45:32 +02:00
CMakeLists.txt
ci : run test-jinja -py on high perf [no ci] ( #18916 )
2026-01-19 20:29:15 +01:00
get-model.cpp
get-model.h
run-json-schema-to-grammar.mjs
test-alloc.cpp
test-arg-parser.cpp
ci, tests : use cmake to download models and remove libcurl dependency ( #18791 )
2026-01-14 07:46:27 +01:00
test-autorelease.cpp
test-backend-ops.cpp
CUDA: fix padding of GQA to power of 2 in FA ( #19115 )
2026-01-26 23:24:58 +01:00
test-backend-sampler.cpp
tests : refactor test-backend-sampler ( #18753 )
2026-01-11 17:31:03 +02:00
test-barrier.cpp
Fix race conditions in threadpool when dealing with dynamic/frequent n_threads changes ( #17748 )
2025-12-10 12:32:23 -08:00
test-c.c
test-chat-parser.cpp
cli : fix reasoning responses in CLI ( #18961 )
2026-01-20 18:23:25 +01:00
test-chat-peg-parser.cpp
cli : fix reasoning responses in CLI ( #18961 )
2026-01-20 18:23:25 +01:00
test-chat-template.cpp
jinja : implement mixed type object keys ( #18955 )
2026-01-27 19:50:42 +01:00
test-chat.cpp
server : support preserving reasoning_content in assistant message ( #18994 )
2026-01-22 21:30:06 +01:00
test-double-float.cpp
test-gbnf-validator.cpp
test-gguf.cpp
GGUF: check that tensor size is representable ( #19072 )
2026-01-24 21:57:51 +01:00
test-grammar-integration.cpp
llama : add token matching support to llama-grammar ( #17816 )
2025-12-09 00:32:57 -06:00
test-grammar-llguidance.cpp
tool/ex/tests: consistently free ctx, then model ( #18168 )
2025-12-22 11:00:37 +01:00
test-grammar-parser.cpp
llama : add token matching support to llama-grammar ( #17816 )
2025-12-09 00:32:57 -06:00
test-jinja.cpp
jinja : undefined should be treated as sequence/iterable (return string/array) by filters/tests ( #19147 )
2026-01-28 14:40:29 +01:00
test-json-partial.cpp
test-json-schema-to-grammar.cpp
common : add nemotron 3 parsing ( #18077 )
2025-12-16 04:05:23 -06:00
test-llama-grammar.cpp
llama : add token matching support to llama-grammar ( #17816 )
2025-12-09 00:32:57 -06:00
test-log.cpp
test-lora-conversion-inference.sh
cli: new CLI experience ( #17824 )
2025-12-10 15:28:59 +01:00
test-model-load-cancel.cpp
test-mtmd-c-api.c
test-opt.cpp
test-peg-parser.cpp
common : introduce composable PEG parser combinators for chat parsing ( #17136 )
2025-12-03 12:45:32 +02:00
test-quantize-fns.cpp
test-quantize-perf.cpp
test-quantize-stats.cpp
server: introduce API for serving / loading / unloading multiple models ( #17470 )
2025-12-01 19:41:04 +01:00
test-regex-partial.cpp
common/grammar : replace problematic backtracking regex [\s\S]* ( #18342 )
2026-01-03 16:02:43 -06:00
test-rope.cpp
test-sampling.cpp
test-state-restore-fragmented.cpp
kv-cache: Fix state restore fragmented cache ( #17982 )
2025-12-15 19:28:35 +02:00
test-thread-safety.cpp
test-tokenizer-0.cpp
tool/ex/tests: consistently free ctx, then model ( #18168 )
2025-12-22 11:00:37 +01:00
test-tokenizer-0.py
test-tokenizer-0.sh
test-tokenizer-1-bpe.cpp
tool/ex/tests: consistently free ctx, then model ( #18168 )
2025-12-22 11:00:37 +01:00
test-tokenizer-1-spm.cpp
tool/ex/tests: consistently free ctx, then model ( #18168 )
2025-12-22 11:00:37 +01:00
test-tokenizer-random.py
test-tokenizers-repo.sh
testing.h
common : implement new jinja template engine ( #18462 )
2026-01-16 11:22:06 +01:00