llama_cpp_for_radxa_dragon_wing_q6a

History

Gaurav Garg fd6ae4ca1c Tensor-parallel: Fix delayed AllReduce on Gemma-4 MoE (#22129 ) * Fix delayed AllReduce on Gemma-4 MoE Skip forward past nodes that don't consume the current one, and allow a chain of MULs. * Check for all sources before skipping nodes * Address review comments		2026-04-20 18:25:39 +02:00
..
cmake
include
src	Tensor-parallel: Fix delayed AllReduce on Gemma-4 MoE (#22129 )	2026-04-20 18:25:39 +02:00
.gitignore
CMakeLists.txt