llama_cpp_for_radxa_dragon_wing_q6a

pingu_98/llama_cpp_for_radxa_dragon_wing_q6a

History

Daniel Bevenius 56b4795842 model-conversion : add support for SentenceTransformers (#16387 ) * model-conversion : add support for SentenceTransformers This commit adds support for models that use SentenceTransformer layers. The motivation for this is that if converted model includes any of the numbered layers specified in the original models repository then these changes enable these models to be used and verified. Currently the model-conversion only support the base model output without any of the additional transformation layers. Usage: Convert the model that also includes the SentenceTransformer layers: ```console (venv) $ export EMBEDDING_MODEL_PATH="~/google/embeddinggemma-300M" (venv) make embedding-convert-model ``` Verify the produced embeddings from the converted model against the original model embeddings: ```console (venv) make embedding-verify-logits-st ``` The original model can be run using SentenceTransformer: ```console (venv) make embedding-run-original-model-st ``` Run the converted model using "SentenceTransformer" layers whic enables pooling and normalization: ```console (venv) make embedding-run-converted-model-st ``` * add model-conversion example requirements * add support for -st flag in embedding model conversion This commit add support for the -st flag in the embedding model conversion script. This will enable models to be converted using sentence transformers dense layers.		2025-10-09 14:35:22 +02:00
..
batched
batched.swift
convert-llama2c-to-ggml	gguf: gguf_writer refactor (#15691 )	2025-09-05 11:34:28 +02:00
deprecation-warning
diffusion	Add LLaDA-7b-MoE diffusion model (#16003 )	2025-09-16 10:38:28 +08:00
embedding	llama : add support for qwen3 reranker (#15824 )	2025-09-25 11:53:09 +03:00
eval-callback	devops: add s390x & ppc64le CI (#15925 )	2025-09-27 02:03:33 +08:00
gen-docs
gguf
gguf-hash
llama.android
llama.swiftui
lookahead
lookup
model-conversion	model-conversion : add support for SentenceTransformers (#16387 )	2025-10-09 14:35:22 +02:00
parallel
passkey
retrieval
save-load-state
simple	examples : support encoder-decoder models in the simple example (#16002 )	2025-09-17 10:29:00 +03:00
simple-chat
simple-cmake-pkg
speculative
speculative-simple
sycl
training
CMakeLists.txt	codeowners : update + cleanup (#16174 )	2025-09-22 18:20:21 +03:00
convert_legacy_llama.py
json_schema_pydantic_example.py
json_schema_to_grammar.py	json : support `enum` values within `allOf` (#15830 )	2025-09-08 16:14:32 -05:00
llama.vim
pydantic_models_to_grammar.py
pydantic_models_to_grammar_examples.py
reason-act.sh
regex_to_grammar.py
server-llama2-13B.sh
server_embd.py
ts-type-to-grammar.sh