| .. |
|
CMakeLists.txt
|
Add Jinja template support (#11016)
|
2025-01-21 13:18:51 +00:00 |
|
llama-adapter.cpp
|
|
|
|
llama-adapter.h
|
|
|
|
llama-arch.cpp
|
llama : add support for GLM-Edge and GLM-Edge-V series models (#10573)
|
2025-02-02 09:48:46 +02:00 |
|
llama-arch.h
|
Add Jinja template support (#11016)
|
2025-01-21 13:18:51 +00:00 |
|
llama-batch.cpp
|
|
|
|
llama-batch.h
|
|
|
|
llama-chat.cpp
|
llama : add support for GLM-Edge and GLM-Edge-V series models (#10573)
|
2025-02-02 09:48:46 +02:00 |
|
llama-chat.h
|
llama : add support for GLM-Edge and GLM-Edge-V series models (#10573)
|
2025-02-02 09:48:46 +02:00 |
|
llama-context.cpp
|
|
|
|
llama-context.h
|
|
|
|
llama-cparams.cpp
|
|
|
|
llama-cparams.h
|
|
|
|
llama-grammar.cpp
|
nit: more informative crash when grammar sampler fails (#11593)
|
2025-02-02 19:58:34 +00:00 |
|
llama-grammar.h
|
Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars (#9639)
|
2025-01-30 19:13:58 +00:00 |
|
llama-hparams.cpp
|
|
|
|
llama-hparams.h
|
|
|
|
llama-impl.cpp
|
|
|
|
llama-impl.h
|
|
|
|
llama-kv-cache.cpp
|
|
|
|
llama-kv-cache.h
|
|
|
|
llama-mmap.cpp
|
mmap: add include for cerrno (#11296)
|
2025-01-20 16:02:43 +02:00 |
|
llama-mmap.h
|
|
|
|
llama-model-loader.cpp
|
llama : minor fixes for up llama load model speed (#11448)
|
2025-01-27 14:42:09 +01:00 |
|
llama-model-loader.h
|
llama : add llama_model_load_from_splits (#11255)
|
2025-01-16 13:54:08 +01:00 |
|
llama-model.cpp
|
llama : add support for GLM-Edge and GLM-Edge-V series models (#10573)
|
2025-02-02 09:48:46 +02:00 |
|
llama-model.h
|
rpc : early register backend devices (#11262)
|
2025-01-17 10:57:09 +02:00 |
|
llama-quant.cpp
|
llama : add llama_model_load_from_splits (#11255)
|
2025-01-16 13:54:08 +01:00 |
|
llama-quant.h
|
|
|
|
llama-sampling.cpp
|
Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars (#9639)
|
2025-01-30 19:13:58 +00:00 |
|
llama-sampling.h
|
|
|
|
llama-vocab.cpp
|
vocab : correctly identify LF token for GPT-2 style BPE tokenizer (#11496)
|
2025-01-30 12:10:59 +02:00 |
|
llama-vocab.h
|
|
|
|
llama.cpp
|
llama : add support for GLM-Edge and GLM-Edge-V series models (#10573)
|
2025-02-02 09:48:46 +02:00 |
|
unicode-data.cpp
|
|
|
|
unicode-data.h
|
|
|
|
unicode.cpp
|
cmake : add sanitizer flags for llama.cpp (#11279)
|
2025-01-18 16:18:15 +02:00 |
|
unicode.h
|
|
|