llama_cpp_for_radxa_dragon_wing_q6a

History

Giuseppe Scrivano 1eeb523c3e vulkan: optimize UMA buffer operations and fix driver hangs (#16059 ) * vulkan: optimize UMA buffer operations and fix driver hangs The previous implementation was blocking the GPU for extended periods, causing the i915 driver to reset the context due to the hangcheck protection. [32628.443070] i915 0000:00:02.0: [drm] GPU HANG: ecode 12:1:85dffffb, in llama-server [194114] [32628.443091] i915 0000:00:02.0: [drm] llama-server[194114] context reset due to GPU hang * vulkan: implement deferred_memset on UMA --------- Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>		2025-09-21 08:31:55 +02:00
..
cmake
include
src	vulkan: optimize UMA buffer operations and fix driver hangs (#16059 )	2025-09-21 08:31:55 +02:00
.gitignore
CMakeLists.txt	ggml : introduce semantic versioning (ggml/1336)	2025-09-20 13:02:14 +03:00