This website requires JavaScript.
Explore
Help
Sign In
pingu_98
/
llama_cpp_for_radxa_dragon_wing_q6a
Watch
1
Star
0
Fork
You've already forked llama_cpp_for_radxa_dragon_wing_q6a
0
Code
Issues
Pull requests
Projects
Releases
Packages
Wiki
Activity
Actions
48bd26501b
llama_cpp_for_radxa_dragon_...
/
ggml
History
theo77186
622cd010ff
ggml: CUDA: add head size 72 for flash-attn (
#16962
)
2025-11-03 14:29:11 +01:00
..
cmake
ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (
#15094
)
2025-08-07 13:45:41 +02:00
include
model: add support for qwen3vl series (
#16780
)
2025-10-30 16:19:14 +01:00
src
ggml: CUDA: add head size 72 for flash-attn (
#16962
)
2025-11-03 14:29:11 +01:00
.gitignore
CMakeLists.txt
Add experimental ggml-hexagon backend for the Hexagon NPU (
#16547
)
2025-10-22 13:47:09 -07:00