llama_cpp_for_radxa_dragon_wing_q6a

pingu_98/llama_cpp_for_radxa_dragon_wing_q6a

History

Johannes Gäßler 10d2af0eaa llama/ggml: add LLM training support (#10544 ) * llama/ggml: add LLM training support more compact progress bar llama_save_model_to_file llama_opt_param_filter ggml_graph_dup force_grads refactor ggml_opt, fix test-opt * remove logits_all * refactor CUDA implementation for ACC * reset graph at beginning of opt period		2025-05-12 14:44:49 +02:00
..
CMakeLists.txt	llama/ggml: add LLM training support (#10544 )	2025-05-12 14:44:49 +02:00
llama-adapter.cpp	llama : do not crash if there is no CPU backend (#13395 )	2025-05-09 13:02:07 +02:00
llama-adapter.h	llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181 )	2025-03-13 12:35:44 +02:00
llama-arch.cpp	model : Nomic Embed Text V2 with Mixture-of-Experts (MoE) architecture (#12466 )	2025-04-28 22:52:15 +03:00
llama-arch.h	model : Nomic Embed Text V2 with Mixture-of-Experts (MoE) architecture (#12466 )	2025-04-28 22:52:15 +03:00
llama-batch.cpp	kv-cache : separate recurrent vs non-recurrent impl (#12799 )	2025-05-02 17:48:36 +03:00
llama-batch.h	kv-cache : separate recurrent vs non-recurrent impl (#12799 )	2025-05-02 17:48:36 +03:00
llama-chat.cpp	llama : one-off chat template fix for Mistral-Small-2503 (#13398 )	2025-05-09 11:17:51 +02:00
llama-chat.h	llama : one-off chat template fix for Mistral-Small-2503 (#13398 )	2025-05-09 11:17:51 +02:00
llama-context.cpp	llama/ggml: add LLM training support (#10544 )	2025-05-12 14:44:49 +02:00
llama-context.h	llama/ggml: add LLM training support (#10544 )	2025-05-12 14:44:49 +02:00
llama-cparams.cpp
llama-cparams.h	Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386 )	2025-05-11 14:18:39 +02:00
llama-grammar.cpp	`tool-call`: fix Qwen 2.5 Coder support, add micro benchmarks, support trigger patterns for lazy grammars (#12034 )	2025-03-05 13:05:13 +00:00
llama-grammar.h	`tool-call`: fix Qwen 2.5 Coder support, add micro benchmarks, support trigger patterns for lazy grammars (#12034 )	2025-03-05 13:05:13 +00:00
llama-graph.cpp	llama/ggml: add LLM training support (#10544 )	2025-05-12 14:44:49 +02:00
llama-graph.h	llama/ggml: add LLM training support (#10544 )	2025-05-12 14:44:49 +02:00
llama-hparams.cpp	hparams : add SWA rope parameters (#12374 )	2025-03-14 09:03:24 +02:00
llama-hparams.h	model : Nomic Embed Text V2 with Mixture-of-Experts (MoE) architecture (#12466 )	2025-04-28 22:52:15 +03:00
llama-impl.cpp
llama-impl.h
llama-io.cpp	llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181 )	2025-03-13 12:35:44 +02:00
llama-io.h	llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181 )	2025-03-13 12:35:44 +02:00
llama-kv-cache.cpp	kv-cache : separate recurrent vs non-recurrent impl (#12799 )	2025-05-02 17:48:36 +03:00
llama-kv-cache.h	kv-cache : separate recurrent vs non-recurrent impl (#12799 )	2025-05-02 17:48:36 +03:00
llama-memory.cpp	llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181 )	2025-03-13 12:35:44 +02:00
llama-memory.h	kv-cache : separate recurrent vs non-recurrent impl (#12799 )	2025-05-02 17:48:36 +03:00
llama-mmap.cpp	mmap : skip resource limit checks on AIX (#12541 )	2025-03-24 12:17:10 +02:00
llama-mmap.h
llama-model-loader.cpp	llama/ggml: add LLM training support (#10544 )	2025-05-12 14:44:49 +02:00
llama-model-loader.h	llama : add option to override model tensor buffers (#11397 )	2025-04-02 14:52:01 +02:00
llama-model-saver.cpp	llama/ggml: add LLM training support (#10544 )	2025-05-12 14:44:49 +02:00
llama-model-saver.h	llama/ggml: add LLM training support (#10544 )	2025-05-12 14:44:49 +02:00
llama-model.cpp	llama/ggml: add LLM training support (#10544 )	2025-05-12 14:44:49 +02:00
llama-model.h	llama/ggml: add LLM training support (#10544 )	2025-05-12 14:44:49 +02:00
llama-quant.cpp	llama/ggml: add LLM training support (#10544 )	2025-05-12 14:44:49 +02:00
llama-quant.h
llama-sampling.cpp	sampling : make top_n_sigma no-op at <=0 or a single candidate (#13345 )	2025-05-06 22:36:24 +02:00
llama-sampling.h
llama-vocab.cpp	llama/ggml: add LLM training support (#10544 )	2025-05-12 14:44:49 +02:00
llama-vocab.h	llama/ggml: add LLM training support (#10544 )	2025-05-12 14:44:49 +02:00
llama.cpp	llama/ggml: add LLM training support (#10544 )	2025-05-12 14:44:49 +02:00
unicode-data.cpp
unicode-data.h
unicode.cpp
unicode.h