llama_cpp_for_radxa_dragon_wing_q6a

pingu_98/llama_cpp_for_radxa_dragon_wing_q6a

History

Radoslav Gerganov 8d94219a4a ggml : add ggml_set_rows (#14274 ) * ggml : add ggml_set_rows Add ggml_set_rows(a, b, c) which copies rows from 'b' into 'a' using indices from 'c'. ref: #8366 * use I64 for indices * ggml : add repeat impl for i64 * ggml : add ggml_is_contiguous_rows * ggml : ggml_set_rows support broadcast * ggml : ggml_set_rows support quantized dst ggml-ci * ggml : support GGML_TYPE_F32 ".from_float" trait * ggml : ggml_set_rows update comment + better index name * tests : add ggml_set_rows * metal : add ggml_set_rows implementation ggml-ci * ggml : simplify forward_dup_f32 * ggml : fix supports_op * tests : add comment to set_rows * ggml : leave the repeat_i64 for a separate PR ggml-ci * ggml : set_rows use std::min instead of MIN * ggml : better error message for set_rows unsupported type * metal : perform op->type check only once * tests : more consistent implementation + more tests ggml-ci --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>		2025-06-27 16:41:40 +03:00
..
batched
batched.swift
convert-llama2c-to-ggml
deprecation-warning
embedding	llama : improve sep token handling (#14272 )	2025-06-20 14:04:09 +02:00
eval-callback	ggml : add ggml_set_rows (#14274 )	2025-06-27 16:41:40 +03:00
gen-docs
gguf
gguf-hash
gritlm
jeopardy
llama.android
llama.swiftui
lookahead
lookup
parallel
passkey
retrieval
save-load-state
simple
simple-chat	examples : fix is_first logic for tokenization (#14329 )	2025-06-22 20:10:07 +03:00
simple-cmake-pkg
speculative
speculative-simple
sycl
training
chat-13B.bat
chat-13B.sh
chat-persistent.sh
chat-vicuna.sh
chat.sh
CMakeLists.txt
convert_legacy_llama.py
json_schema_pydantic_example.py
json_schema_to_grammar.py
llama.vim
llm.vim
Miku.sh
pydantic_models_to_grammar.py
pydantic_models_to_grammar_examples.py
reason-act.sh
regex_to_grammar.py
server-llama2-13B.sh
server_embd.py
ts-type-to-grammar.sh