llama_cpp_for_radxa_dragon_.../docs
Neo Zhang eddd7a13a5
[SYCL] Optimize Q4_0 mul_mat for Arc770, add scripts (#22291)
* opt arc770 for Q4_0

* add for Q4_0

* update the script

* add help script for windows

* update guide

* fix format issue

* convert from dos to unix for format issue

* fix missed -sm parameter
2026-04-25 09:20:14 +03:00
..
android
backend [SYCL] Optimize Q4_0 mul_mat for Arc770, add scripts (#22291) 2026-04-25 09:20:14 +03:00
development docs: more extensive RoPE documentation [no ci] (#21953) 2026-04-15 14:45:16 +02:00
multimodal chore : correct typos [no ci] (#20041) 2026-03-05 08:50:21 +01:00
ops ggml-webgpu: support for SSM_SCAN and disable set_rows error checking (#22327) 2026-04-25 09:18:15 +03:00
android.md
autoparser.md common/parser: add proper reasoning tag prefill reading (#20424) 2026-03-19 16:58:21 +01:00
build-riscv64-spacemit.md
build-s390x.md
build.md CUDA: require explicit opt-in for P2P access (#21910) 2026-04-15 16:01:46 +02:00
docker.md CI : Enable CUDA and Vulkan ARM64 runners and fix CI/CD (#21122) 2026-03-30 20:24:37 +02:00
function-calling.md
install.md
llguidance.md
multimodal.md docs: listing qwen3-asr and qwen3-omni as supported (#21857) 2026-04-13 22:28:17 +02:00
ops.md ggml-webgpu: support for SSM_SCAN and disable set_rows error checking (#22327) 2026-04-25 09:18:15 +03:00
preset.md
speculative.md