llama_cpp_for_radxa_dragon_.../ggml
Jeff Bolz 1384abf8b8
vulkan: handle mat_mul with A matrix > 4GB (#16176)
* vulkan: handle mat_mul with A matrix > 4GB

This change splits mat_mul operations with huge A matrix into chunks in the M
dimension. This works well for stable-diffusion use cases where the im2col
matrix has very large M.

Fix the order of setting the stride in mul_mm_cm2 - setting the dimension
clobbers the stride, so stride should be set after.

* build fixes
2025-09-27 20:36:34 -05:00
..
cmake
include
src vulkan: handle mat_mul with A matrix > 4GB (#16176) 2025-09-27 20:36:34 -05:00
.gitignore
CMakeLists.txt common : use cpp-httplib as a cURL alternative for downloads (#16185) 2025-09-26 14:12:19 +03:00