..
peg-parser
common : introduce composable PEG parser combinators for chat parsing ( #17136 )
2025-12-03 12:45:32 +02:00
.gitignore
common : introduce composable PEG parser combinators for chat parsing ( #17136 )
2025-12-03 12:45:32 +02:00
CMakeLists.txt
tests : refactor test-backend-sampler ( #18753 )
2026-01-11 17:31:03 +02:00
get-model.cpp
get-model.h
run-json-schema-to-grammar.mjs
test-alloc.cpp
ggml : fix graph reallocation with multiple chunks ( #16396 )
2025-10-03 13:49:08 +02:00
test-arg-parser.cpp
vendor : update cpp-httplib to 0.30.0 ( #18660 )
2026-01-08 13:53:54 +01:00
test-autorelease.cpp
test-backend-ops.cpp
vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) ( #18678 )
2026-01-12 12:32:13 +01:00
test-backend-sampler.cpp
tests : refactor test-backend-sampler ( #18753 )
2026-01-11 17:31:03 +02:00
test-barrier.cpp
Fix race conditions in threadpool when dealing with dynamic/frequent n_threads changes ( #17748 )
2025-12-10 12:32:23 -08:00
test-c.c
test-chat-parser.cpp
common : handle unicode during partial json parsing ( #16526 )
2025-10-12 16:18:47 +03:00
test-chat-peg-parser.cpp
common : introduce composable PEG parser combinators for chat parsing ( #17136 )
2025-12-03 12:45:32 +02:00
test-chat-template.cpp
chat : Granite Docling stopping ( #16438 )
2025-10-06 18:59:40 +02:00
test-chat.cpp
chat: make tool description and parameters optional per OpenAI spec ( #18478 )
2025-12-31 17:21:37 -06:00
test-double-float.cpp
test-gbnf-validator.cpp
test-gguf.cpp
test-grammar-integration.cpp
llama : add token matching support to llama-grammar ( #17816 )
2025-12-09 00:32:57 -06:00
test-grammar-llguidance.cpp
tool/ex/tests: consistently free ctx, then model ( #18168 )
2025-12-22 11:00:37 +01:00
test-grammar-parser.cpp
llama : add token matching support to llama-grammar ( #17816 )
2025-12-09 00:32:57 -06:00
test-json-partial.cpp
common : handle unicode during partial json parsing ( #16526 )
2025-10-12 16:18:47 +03:00
test-json-schema-to-grammar.cpp
common : add nemotron 3 parsing ( #18077 )
2025-12-16 04:05:23 -06:00
test-llama-grammar.cpp
llama : add token matching support to llama-grammar ( #17816 )
2025-12-09 00:32:57 -06:00
test-log.cpp
test-lora-conversion-inference.sh
cli: new CLI experience ( #17824 )
2025-12-10 15:28:59 +01:00
test-model-load-cancel.cpp
test-mtmd-c-api.c
test-opt.cpp
test-peg-parser.cpp
common : introduce composable PEG parser combinators for chat parsing ( #17136 )
2025-12-03 12:45:32 +02:00
test-quantize-fns.cpp
test-quantize-perf.cpp
test-quantize-stats.cpp
server: introduce API for serving / loading / unloading multiple models ( #17470 )
2025-12-01 19:41:04 +01:00
test-regex-partial.cpp
common/grammar : replace problematic backtracking regex [\s\S]* ( #18342 )
2026-01-03 16:02:43 -06:00
test-rope.cpp
ggml-cpu: templateify ggml_compute_forward_rope_f32 and _f16 ( #16805 )
2025-11-11 13:33:24 +02:00
test-sampling.cpp
test-state-restore-fragmented.cpp
kv-cache: Fix state restore fragmented cache ( #17982 )
2025-12-15 19:28:35 +02:00
test-thread-safety.cpp
server : support unified cache across slots ( #16736 )
2025-11-02 18:14:04 +02:00
test-tokenizer-0.cpp
tool/ex/tests: consistently free ctx, then model ( #18168 )
2025-12-22 11:00:37 +01:00
test-tokenizer-0.py
test-tokenizer-0.sh
test-tokenizer-1-bpe.cpp
tool/ex/tests: consistently free ctx, then model ( #18168 )
2025-12-22 11:00:37 +01:00
test-tokenizer-1-spm.cpp
tool/ex/tests: consistently free ctx, then model ( #18168 )
2025-12-22 11:00:37 +01:00
test-tokenizer-random.py
test-tokenizers-repo.sh