llama_cpp_for_radxa_dragon_.../docs
Chenguang Li 2f853687b3
CANN: Support eager execution mode under ACL graph compilation (#15712)
* [CANN] Support eager execution mode under ACL graph compilation

Add support for running operators in eager mode while ACL graph
compilation is enabled. This allows bypassing graph execution
and directly submitting ops, which is useful for debugging and
reducing graph build overhead in certain scenarios.

Signed-off-by: noemotiovon <757486878@qq.com>

* fix typo

Signed-off-by: noemotiovon <757486878@qq.com>

* rename to acl_graph_mode

Signed-off-by: noemotiovon <757486878@qq.com>

---------

Signed-off-by: noemotiovon <757486878@qq.com>
2025-09-02 14:07:48 +08:00
..
backend CANN: Support eager execution mode under ACL graph compilation (#15712) 2025-09-02 14:07:48 +08:00
development
multimodal model : support MiniCPM-V 4.5 (#15575) 2025-08-26 10:05:55 +02:00
ops ggml: initial IBM zDNN backend (#14975) 2025-08-15 21:11:22 +08:00
android.md
build-s390x.md ggml-cpu: Support Q5_0 and Q5_1 on s390x (#15486) 2025-08-22 16:11:04 +08:00
build.md Update build.md to remove MSVC arm64 notes (#15684) 2025-08-30 23:51:28 +08:00
docker.md
function-calling.md server : add documentation for parallel_tool_calls param (#15647) 2025-08-29 20:25:40 +03:00
install.md
llguidance.md
multimodal.md
ops.md ggml: initial IBM zDNN backend (#14975) 2025-08-15 21:11:22 +08:00