llama_cpp_for_radxa_dragon_.../src
Daniel Bevenius ca0ef2dddb
llama : clarify comment about pp and tg graphs [no ci] (#14895)
* llama : clarify comment about pp and tg graphs [no ci]

This commit clarifies the comment in `llama-context.cpp` regarding the
prefill prompt (pp), and token generation (tg) graphs.

The motivation for this is that I've struggled to remember these and had
to look them up more than once, so I thought it would be helpful to add
a comment that makes it clear what these stand for.

* squash! llama : clarify comment about pp and tg graphs [no ci]

Change "pp" to "prompt processing".
2025-07-27 12:10:51 +02:00
..
CMakeLists.txt
llama-adapter.cpp
llama-adapter.h
llama-arch.cpp chat : fix kimi-k2 chat template (#14852) 2025-07-24 13:59:56 +02:00
llama-arch.h model : add EXAONE 4.0 support (#14630) 2025-07-18 10:45:49 +02:00
llama-batch.cpp llama : reuse compute graphs (#14482) 2025-07-17 19:08:33 +03:00
llama-batch.h llama : reuse compute graphs (#14482) 2025-07-17 19:08:33 +03:00
llama-chat.cpp chat : fix kimi-k2 chat template (#14852) 2025-07-24 13:59:56 +02:00
llama-chat.h model : add EXAONE 4.0 support (#14630) 2025-07-18 10:45:49 +02:00
llama-context.cpp llama : clarify comment about pp and tg graphs [no ci] (#14895) 2025-07-27 12:10:51 +02:00
llama-context.h context : restore preemptive sched reset when LLAMA_SET_ROWS=0 (#14870) 2025-07-25 14:28:06 +03:00
llama-cparams.cpp
llama-cparams.h llama : add high-throughput mode (#14363) 2025-07-16 16:35:42 +03:00
llama-grammar.cpp
llama-grammar.h
llama-graph.cpp metal : fuse add, mul + add tests (#14596) 2025-07-18 20:37:26 +03:00
llama-graph.h graph : refactor context to not pass gf explicitly (#14629) 2025-07-18 08:29:28 +03:00
llama-hparams.cpp llama : add high-throughput mode (#14363) 2025-07-16 16:35:42 +03:00
llama-hparams.h model : make rope_yarn_log_mul optional for deepseek2 (#14896) 2025-07-27 11:18:37 +03:00
llama-impl.cpp
llama-impl.h
llama-io.cpp
llama-io.h
llama-kv-cache-unified-iswa.cpp llama : add high-throughput mode (#14363) 2025-07-16 16:35:42 +03:00
llama-kv-cache-unified-iswa.h llama : add high-throughput mode (#14363) 2025-07-16 16:35:42 +03:00
llama-kv-cache-unified.cpp kv-cache : fix k-shift for multiple streams (#14742) 2025-07-17 20:52:33 +03:00
llama-kv-cache-unified.h llama : reuse compute graphs (#14482) 2025-07-17 19:08:33 +03:00
llama-kv-cells.h kv-cache : use ggml_set_rows (#14285) 2025-07-03 10:53:35 +03:00
llama-memory-hybrid.cpp llama : fix parameter order for hybrid memory initialization (#14725) 2025-07-16 21:17:25 +02:00
llama-memory-hybrid.h kv-cache : use ggml_set_rows (#14285) 2025-07-03 10:53:35 +03:00
llama-memory-recurrent.cpp memory : handle saving/loading null layers in recurrent memory (#14675) 2025-07-23 11:16:41 +03:00
llama-memory-recurrent.h
llama-memory.cpp memory : correctly handle failure in apply() (#14438) 2025-06-30 18:03:03 +03:00
llama-memory.h memory : correctly handle failure in apply() (#14438) 2025-06-30 18:03:03 +03:00
llama-mmap.cpp
llama-mmap.h
llama-model-loader.cpp
llama-model-loader.h
llama-model-saver.cpp
llama-model-saver.h
llama-model.cpp model : make rope_yarn_log_mul optional for deepseek2 (#14896) 2025-07-27 11:18:37 +03:00
llama-model.h model: add Ernie 4.5 MoE support (#14658) 2025-07-17 23:15:32 +02:00
llama-quant.cpp quantize : fix minor logic flaw in --tensor-type (#14572) 2025-07-13 18:02:17 +02:00
llama-quant.h
llama-sampling.cpp
llama-sampling.h
llama-vocab.cpp model : add EXAONE 4.0 support (#14630) 2025-07-18 10:45:49 +02:00
llama-vocab.h Support diffusion models: Add Dream 7B (#14644) 2025-07-16 20:03:51 +08:00
llama.cpp
unicode-data.cpp
unicode-data.h
unicode.cpp model : add Kimi-K2 support (#14654) 2025-07-15 21:54:22 +02:00
unicode.h model : add Kimi-K2 support (#14654) 2025-07-15 21:54:22 +02:00