llama_cpp_for_radxa_dragon_.../tools
Xuan-Son Nguyen 32916a4907
clip : refactor graph builder (#13321)
* mtmd : refactor graph builder

* fix qwen2vl

* clean up siglip cgraph

* pixtral migrated

* move minicpmv to a dedicated build function

* move max_feature_layer to build_llava

* use build_attn for minicpm resampler

* fix windows build

* add comment for batch_size

* also support tinygemma3 test model

* qwen2vl does not use RMS norm

* fix qwen2vl norm (2)
2025-05-06 22:40:24 +02:00
..
batched-bench
cvector-generator
export-lora
gguf-split
imatrix
llama-bench
main
mtmd clip : refactor graph builder (#13321) 2025-05-06 22:40:24 +02:00
perplexity
quantize
rpc
run
server sampling : Integrate Top-nσ into main sampling chain (and add it to the server) (#13264) 2025-05-05 22:12:19 +02:00
tokenize
tts
CMakeLists.txt