llama_cpp_for_radxa_dragon_.../tests
Aman Gupta 1e38a7a6fa
CUDA: use shared mem for ssm_conv (#20128)
* CUDA: use shared mem for ssm_conv

* fuse silu + ssm_conv

* fuse unary + mul

* enable for fp16

* formatting

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

---------

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
2026-03-06 23:09:59 +08:00
..
peg-parser common : implement new jinja template engine (#18462) 2026-01-16 11:22:06 +01:00
.gitignore
CMakeLists.txt tests : model metadata loading from huggingface (#19796) 2026-02-28 10:44:38 +01:00
get-model.cpp
get-model.h
gguf-model-data.cpp tests : model metadata loading from huggingface (#19796) 2026-02-28 10:44:38 +01:00
gguf-model-data.h tests : model metadata loading from huggingface (#19796) 2026-02-28 10:44:38 +01:00
run-json-schema-to-grammar.mjs
test-alloc.cpp chore : correct typos [no ci] (#20041) 2026-03-05 08:50:21 +01:00
test-arg-parser.cpp ci, tests : use cmake to download models and remove libcurl dependency (#18791) 2026-01-14 07:46:27 +01:00
test-autorelease.cpp docs : Minor cleanups (#19252) 2026-02-02 08:38:55 +02:00
test-backend-ops.cpp CUDA: use shared mem for ssm_conv (#20128) 2026-03-06 23:09:59 +08:00
test-backend-sampler.cpp tests : fix typos in comments in test-backend-sampler [no ci] (#19824) 2026-02-23 17:12:02 +01:00
test-barrier.cpp
test-c.c
test-chat-parser.cpp cli : fix reasoning responses in CLI (#18961) 2026-01-20 18:23:25 +01:00
test-chat-peg-parser.cpp cli : fix reasoning responses in CLI (#18961) 2026-01-20 18:23:25 +01:00
test-chat-template.cpp jinja : do not pass empty tools and add some none filters (#19176) 2026-01-29 14:06:54 +01:00
test-chat.cpp chore : correct typos [no ci] (#20041) 2026-03-05 08:50:21 +01:00
test-double-float.cpp
test-gbnf-validator.cpp
test-gguf-model-data.cpp tests : model metadata loading from huggingface (#19796) 2026-02-28 10:44:38 +01:00
test-gguf.cpp ggml/gguf : prevent integer overflows (#19856) 2026-02-24 20:17:11 +02:00
test-grammar-integration.cpp
test-grammar-llguidance.cpp tool/ex/tests: consistently free ctx, then model (#18168) 2025-12-22 11:00:37 +01:00
test-grammar-parser.cpp
test-jinja.cpp jinja: correct stats for tojson and string filters (#19785) 2026-02-22 21:08:23 +01:00
test-json-partial.cpp
test-json-schema-to-grammar.cpp
test-llama-grammar.cpp
test-log.cpp
test-lora-conversion-inference.sh
test-model-load-cancel.cpp
test-mtmd-c-api.c
test-opt.cpp
test-peg-parser.cpp
test-quantize-fns.cpp
test-quantize-perf.cpp
test-quantize-stats.cpp
test-regex-partial.cpp common/grammar : replace problematic backtracking regex [\s\S]* (#18342) 2026-01-03 16:02:43 -06:00
test-rope.cpp
test-sampling.cpp
test-state-restore-fragmented.cpp
test-thread-safety.cpp
test-tokenizer-0.cpp tool/ex/tests: consistently free ctx, then model (#18168) 2025-12-22 11:00:37 +01:00
test-tokenizer-0.py
test-tokenizer-0.sh model : add Jina Embeddings v5 Nano (partial EuroBERT) support (#19826) 2026-02-26 12:14:09 +01:00
test-tokenizer-1-bpe.cpp tool/ex/tests: consistently free ctx, then model (#18168) 2025-12-22 11:00:37 +01:00
test-tokenizer-1-spm.cpp tool/ex/tests: consistently free ctx, then model (#18168) 2025-12-22 11:00:37 +01:00
test-tokenizer-random.py
test-tokenizers-repo.sh
testing.h common : implement new jinja template engine (#18462) 2026-01-16 11:22:06 +01:00