| .. |
|
models
|
models : deduplicate delta-net graphs for Qwen family (#19597)
|
2026-02-16 14:35:04 +02:00 |
|
CMakeLists.txt
|
models : deduplicate delta-net graphs for Qwen family (#19597)
|
2026-02-16 14:35:04 +02:00 |
|
llama-adapter.cpp
|
|
|
|
llama-adapter.h
|
graph : fix KQ mask, lora, cvec reuse checks (#19644)
|
2026-02-16 09:21:11 +02:00 |
|
llama-arch.cpp
|
model: support GLM MoE DSA arch (NOTE: indexer is not yet supported) (#19460)
|
2026-02-13 14:56:53 +01:00 |
|
llama-arch.h
|
model: support GLM MoE DSA arch (NOTE: indexer is not yet supported) (#19460)
|
2026-02-13 14:56:53 +01:00 |
|
llama-batch.cpp
|
|
|
|
llama-batch.h
|
|
|
|
llama-chat.cpp
|
docs : Minor cleanups (#19252)
|
2026-02-02 08:38:55 +02:00 |
|
llama-chat.h
|
|
|
|
llama-context.cpp
|
graph : fix KQ mask, lora, cvec reuse checks (#19644)
|
2026-02-16 09:21:11 +02:00 |
|
llama-context.h
|
graph : fix KQ mask, lora, cvec reuse checks (#19644)
|
2026-02-16 09:21:11 +02:00 |
|
llama-cparams.cpp
|
|
|
|
llama-cparams.h
|
|
|
|
llama-grammar.cpp
|
llama : rename llama-sampling to llama-sampler (#19363)
|
2026-02-06 07:26:54 +01:00 |
|
llama-grammar.h
|
|
|
|
llama-graph.cpp
|
graph : fix KQ mask, lora, cvec reuse checks (#19644)
|
2026-02-16 09:21:11 +02:00 |
|
llama-graph.h
|
Kimi-Linear support (backend agnostic + MLA KV cache) (#18755)
|
2026-02-06 11:39:58 +01:00 |
|
llama-hparams.cpp
|
Kimi-Linear support (backend agnostic + MLA KV cache) (#18755)
|
2026-02-06 11:39:58 +01:00 |
|
llama-hparams.h
|
model: support GLM MoE DSA arch (NOTE: indexer is not yet supported) (#19460)
|
2026-02-13 14:56:53 +01:00 |
|
llama-impl.cpp
|
|
|
|
llama-impl.h
|
llama : refactor sampling_info to use buffer_view template (#19368)
|
2026-02-11 05:38:13 +01:00 |
|
llama-io.cpp
|
|
|
|
llama-io.h
|
|
|
|
llama-kv-cache-iswa.cpp
|
model : support Step3.5-Flash (#19283)
|
2026-02-06 21:06:14 +01:00 |
|
llama-kv-cache-iswa.h
|
|
|
|
llama-kv-cache.cpp
|
model : support Step3.5-Flash (#19283)
|
2026-02-06 21:06:14 +01:00 |
|
llama-kv-cache.h
|
|
|
|
llama-kv-cells.h
|
|
|
|
llama-memory-hybrid-iswa.cpp
|
memory : add llama_memory_hybrid_iswa (#18601)
|
2026-01-21 14:30:23 +02:00 |
|
llama-memory-hybrid-iswa.h
|
memory : add llama_memory_hybrid_iswa (#18601)
|
2026-01-21 14:30:23 +02:00 |
|
llama-memory-hybrid.cpp
|
|
|
|
llama-memory-hybrid.h
|
|
|
|
llama-memory-recurrent.cpp
|
memory : clarify comments for r_l and s_l tensors [no ci] (#19203)
|
2026-01-30 15:18:41 +01:00 |
|
llama-memory-recurrent.h
|
|
|
|
llama-memory.cpp
|
|
|
|
llama-memory.h
|
|
|
|
llama-mmap.cpp
|
mmap: Fix Windows handle lifetime (#19598)
|
2026-02-14 10:05:12 +02:00 |
|
llama-mmap.h
|
|
|
|
llama-model-loader.cpp
|
llama : disable Direct IO by default (#19109)
|
2026-01-28 09:11:13 +02:00 |
|
llama-model-loader.h
|
|
|
|
llama-model-saver.cpp
|
kv-cache : support V-less cache (#19067)
|
2026-01-25 15:48:56 +02:00 |
|
llama-model-saver.h
|
|
|
|
llama-model.cpp
|
model: support GLM MoE DSA arch (NOTE: indexer is not yet supported) (#19460)
|
2026-02-13 14:56:53 +01:00 |
|
llama-model.h
|
model: support GLM MoE DSA arch (NOTE: indexer is not yet supported) (#19460)
|
2026-02-13 14:56:53 +01:00 |
|
llama-quant.cpp
|
Kimi-Linear support (backend agnostic + MLA KV cache) (#18755)
|
2026-02-06 11:39:58 +01:00 |
|
llama-quant.h
|
|
|
|
llama-sampler.cpp
|
llama : rename llama-sampling to llama-sampler (#19363)
|
2026-02-06 07:26:54 +01:00 |
|
llama-sampler.h
|
llama : rename llama-sampling to llama-sampler (#19363)
|
2026-02-06 07:26:54 +01:00 |
|
llama-vocab.cpp
|
convert : add JoyAI-LLM-Flash (#19651)
|
2026-02-16 22:49:57 +01:00 |
|
llama-vocab.h
|
convert : add JoyAI-LLM-Flash (#19651)
|
2026-02-16 22:49:57 +01:00 |
|
llama.cpp
|
llama: fix integer type consistency in split helpers (#18894)
|
2026-01-25 09:10:52 +02:00 |
|
unicode-data.cpp
|
|
|
|
unicode-data.h
|
|
|
|
unicode.cpp
|
model: Add support for Tiny Aya Models (#19611)
|
2026-02-16 16:28:46 +01:00 |
|
unicode.h
|
|
|