llama_cpp_for_radxa_dragon_.../tests
Georgi Gerganov a5eaa1d6a3
mla : make the V tensor a view of K (#18986)
* mla : pass V as a view of K to the FA op

* cuda : adjust mla logic to new layout

* kv-cache : fix rope shift

* tests : remove comment

* cuda : fix reusable_cutoff

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

---------

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
2026-01-22 22:09:01 +02:00
..
peg-parser
.gitignore
CMakeLists.txt
get-model.cpp
get-model.h
run-json-schema-to-grammar.mjs
test-alloc.cpp
test-arg-parser.cpp
test-autorelease.cpp
test-backend-ops.cpp mla : make the V tensor a view of K (#18986) 2026-01-22 22:09:01 +02:00
test-backend-sampler.cpp
test-barrier.cpp
test-c.c
test-chat-parser.cpp cli : fix reasoning responses in CLI (#18961) 2026-01-20 18:23:25 +01:00
test-chat-peg-parser.cpp cli : fix reasoning responses in CLI (#18961) 2026-01-20 18:23:25 +01:00
test-chat-template.cpp
test-chat.cpp cli : fix reasoning responses in CLI (#18961) 2026-01-20 18:23:25 +01:00
test-double-float.cpp
test-gbnf-validator.cpp
test-gguf.cpp
test-grammar-integration.cpp
test-grammar-llguidance.cpp
test-grammar-parser.cpp
test-jinja.cpp jinja: support none|string (#18995) 2026-01-21 19:24:37 +01:00
test-json-partial.cpp
test-json-schema-to-grammar.cpp
test-llama-grammar.cpp
test-log.cpp
test-lora-conversion-inference.sh
test-model-load-cancel.cpp
test-mtmd-c-api.c
test-opt.cpp
test-peg-parser.cpp
test-quantize-fns.cpp
test-quantize-perf.cpp
test-quantize-stats.cpp
test-regex-partial.cpp
test-rope.cpp
test-sampling.cpp
test-state-restore-fragmented.cpp
test-thread-safety.cpp
test-tokenizer-0.cpp
test-tokenizer-0.py
test-tokenizer-0.sh
test-tokenizer-1-bpe.cpp
test-tokenizer-1-spm.cpp
test-tokenizer-random.py
test-tokenizers-repo.sh
testing.h