llama_cpp_for_radxa_dragon_.../tools
Tianyue-Zhao bacddc049a
model: Add support for CogVLM model (#15002)
* Added GGUF mappings for CogVLM model

* Add tensor mapping for CogVLM visual encoder

* Add CogVLM to conversion script, no vision part yet

* Added CogVLM vision model to conversion script

* Add graph for CogVLM CLIP model

* Add graph for CogVLM

* Fixes for CogVLM. Now compiles.

* Model now runs

* Fixes for cogvlm graph

* Account for graph context change after rebase

* Changes for whitespace

* Changes in convert script according to comments

* Switch CogVLM LLM graph to merged QKV tensor

* Use rope_type variable instead of direct definition

* Change CogVLM CLIP encoder to use SWIGLU

* Switch CogVLM CLIP to use merged QKV

* Apply rebase edits and remove ggml_cont call that is now unnecessary

* clean up

---------

Co-authored-by: Xuan Son Nguyen <son@huggingface.co>
2025-10-30 12:18:50 +01:00
..
batched-bench
cvector-generator
export-lora
gguf-split
imatrix Manually link -lbsd to resolve flock symbol on AIX (#16610) 2025-10-23 19:37:31 +08:00
llama-bench llama-bench : clarify benchmarked parts of the computation (#16823) 2025-10-28 19:41:43 +02:00
main
mtmd model: Add support for CogVLM model (#15002) 2025-10-30 12:18:50 +01:00
perplexity
quantize
rpc rpc : report actual free memory (#16616) 2025-10-17 18:02:52 +03:00
run Manually link -lbsd to resolve flock symbol on AIX (#16610) 2025-10-23 19:37:31 +08:00
server memory : remove KV cache size padding (#16812) 2025-10-28 20:19:44 +02:00
tokenize
tts
CMakeLists.txt