llama_cpp_for_radxa_dragon_.../tools
Maximilian Werk 66287bdaac
model : add Jina Embeddings v5 Nano (partial EuroBERT) support (#19826)
* WIP: Add EuroBERT support with autoformatting changes

This commit includes:
- EuroBERT model implementation for GGUF conversion
- C++ backend support for EuroBERT architecture
- Unintended autoformatting changes to Python files

Saving before reverting formatting-only changes.

* feat: add back eos assert when not last token pooling

* feat: removed duplicated code and cleanup

* feat: removed not working architectures and unnecessary check

* fix: typo

* fix: dynamic pooling config

* feat: added an example model for eurobert

* feat: proper llama-vocab implementation for jina-v5

* fix: removed unnecessary comments
2026-02-26 12:14:09 +01:00
..
batched-bench
cli cli : provide model with text filename (#19783) 2026-02-22 22:33:49 +01:00
completion llama : remove write/read of output ids/logits/embeddings (#18862) 2026-02-23 07:04:30 +01:00
cvector-generator
export-lora
fit-params
gguf-split
imatrix model : add Jina Embeddings v5 Nano (partial EuroBERT) support (#19826) 2026-02-26 12:14:09 +01:00
llama-bench
mtmd model: Add PaddleOCR-VL model support (#18825) 2026-02-19 17:05:25 +01:00
perplexity perplexity: add proper batching (#19661) 2026-02-16 18:44:44 +02:00
quantize quantize : add --dry-run option (#19526) 2026-02-20 09:20:16 +01:00
rpc NetBSD build support (#19589) 2026-02-14 09:47:01 +01:00
server server : fix typo in server README.md (#19900) 2026-02-26 11:26:16 +01:00
tokenize
tts
CMakeLists.txt