llama_cpp_for_radxa_dragon_wing_q6a

pingu_98/llama_cpp_for_radxa_dragon_wing_q6a

History

Maximilian Werk 66287bdaac model : add Jina Embeddings v5 Nano (partial EuroBERT) support (#19826 ) * WIP: Add EuroBERT support with autoformatting changes This commit includes: - EuroBERT model implementation for GGUF conversion - C++ backend support for EuroBERT architecture - Unintended autoformatting changes to Python files Saving before reverting formatting-only changes. * feat: add back eos assert when not last token pooling * feat: removed duplicated code and cleanup * feat: removed not working architectures and unnecessary check * fix: typo * fix: dynamic pooling config * feat: added an example model for eurobert * feat: proper llama-vocab implementation for jina-v5 * fix: removed unnecessary comments		2026-02-26 12:14:09 +01:00
..
batched-bench
cli	cli : provide model with text filename (#19783 )	2026-02-22 22:33:49 +01:00
completion	llama : remove write/read of output ids/logits/embeddings (#18862 )	2026-02-23 07:04:30 +01:00
cvector-generator
export-lora
fit-params
gguf-split
imatrix	model : add Jina Embeddings v5 Nano (partial EuroBERT) support (#19826 )	2026-02-26 12:14:09 +01:00
llama-bench
mtmd	model: Add PaddleOCR-VL model support (#18825 )	2026-02-19 17:05:25 +01:00
perplexity	perplexity: add proper batching (#19661 )	2026-02-16 18:44:44 +02:00
quantize	quantize : add --dry-run option (#19526 )	2026-02-20 09:20:16 +01:00
rpc	NetBSD build support (#19589 )	2026-02-14 09:47:01 +01:00
server	server : fix typo in server README.md (#19900 )	2026-02-26 11:26:16 +01:00
tokenize
tts
CMakeLists.txt