llama_cpp_for_radxa_dragon_wing_q6a

History

Wallentri f2c0dfb739 Use fp32 in cuBLAS V100 to avoid overflows, env variables to override cuBLAS compute type (#19959 ) * Update ggml-cuda.cu * Update ggml-cuda.cu * Update build.md * Update build.md * Update ggml/src/ggml-cuda/ggml-cuda.cu Co-authored-by: Johannes Gäßler <johannesg@5d6.de> * Update ggml-cuda.cu * Update build.md * Update ggml/src/ggml-cuda/ggml-cuda.cu Co-authored-by: Johannes Gäßler <johannesg@5d6.de> * Update build.md * Update ggml-cuda.cu * Update ggml-cuda.cu --------- Co-authored-by: Johannes Gäßler <johannesg@5d6.de>		2026-03-14 15:43:13 +08:00
..
cmake
include	ggml : add OpenVINO backend (#15307 )	2026-03-14 07:56:55 +02:00
src	Use fp32 in cuBLAS V100 to avoid overflows, env variables to override cuBLAS compute type (#19959 )	2026-03-14 15:43:13 +08:00
.gitignore
CMakeLists.txt	ggml : add OpenVINO backend (#15307 )	2026-03-14 07:56:55 +02:00