llama_cpp_for_radxa_dragon_.../ggml
neha-ha a6cc43c286
ggml-webgpu: updated matrix-vector multiplication (#21738)
* merged properly, but slow q3_k and q5_k with u32 indexing

* Start on new mat-vec

* New format float paths working

* Working q4_0

* Work on remaining legacy q-types

* port k-quants to new matvec

* remove old shader

* Remove old constants, format

* remove accidental file

---------

Co-authored-by: Neha Abbas <nehaabbas@ReeseLevines-MacBook-Pro.local>
Co-authored-by: Reese Levine <reeselevine1@gmail.com>
2026-04-20 07:37:17 -07:00
..
cmake
include
src ggml-webgpu: updated matrix-vector multiplication (#21738) 2026-04-20 07:37:17 -07:00
.gitignore
CMakeLists.txt cmake: remove CMP0194 policy to restore MSVC builds (#21934) 2026-04-19 10:25:05 +03:00