llama_cpp_for_radxa_dragon_.../ggml
Georgi Gerganov 0a319bb75e
metal : add support for non-padded FA KV (#16148)
* metal : pad K, V and Mask when needed

* cont : simplify

* cuda : add TODO about KV padding requirement

* metal : add comments

* metal : remove mask padding requirement
2025-10-07 08:23:30 +03:00
..
cmake
include rpc : add support for multiple devices (#16276) 2025-10-04 12:49:16 +03:00
src metal : add support for non-padded FA KV (#16148) 2025-10-07 08:23:30 +03:00
.gitignore
CMakeLists.txt