llama_cpp_for_radxa_dragon_wing_q6a

History

Oliver Simons e06088da0f CUDA: Fix non-contig rope (#19338 ) * Rename variables + fix rope_neox Seems memory layout is shared with Vulkan so we can port fix from https://github.com/ggml-org/llama.cpp/pull/19299 * Fix rope_multi * Fix rope_vision * Fix rope_norm * Rename ne* to ne0* for consistent variable naming * cont : consistent stride names --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>		2026-02-08 15:12:51 +02:00
..
cmake
include
src	CUDA: Fix non-contig rope (#19338 )	2026-02-08 15:12:51 +02:00
.gitignore
CMakeLists.txt