llama_cpp_for_radxa_dragon_wing_q6a

pingu_98/llama_cpp_for_radxa_dragon_wing_q6a

History

Julius Tischbein 2038101bd9 llama : add `use_direct_io` flag for model loading (#18166 ) * Adding --direct-io flag for model loading * Fixing read_raw() calls * Fixing Windows read_raw_at * Changing type off_t to size_t for windows and Renaming functions * disable direct io when mmap is explicitly enabled * Use read_raw_unsafe when upload_backend is available, not functional on some devices with Vulkan and SYCL * Fallback to std::fread in case O_DIRECT fails due to bad address * Windows: remove const keywords and unused functions * Update src/llama-mmap.cpp Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> --------- Co-authored-by: jtischbein <jtischbein@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>		2026-01-08 08:35:30 +02:00
..
batched
batched.swift
convert-llama2c-to-ggml
debug	examples : add debug utility/example (#18464 )	2026-01-07 10:42:19 +01:00
deprecation-warning
diffusion	llama : add `use_direct_io` flag for model loading (#18166 )	2026-01-08 08:35:30 +02:00
embedding	model : add LFM2-ColBert-350M (#18607 )	2026-01-05 19:52:56 +01:00
eval-callback
gen-docs
gguf
gguf-hash
idle
llama.android
llama.swiftui
lookahead
lookup
model-conversion	examples : add debug utility/example (#18464 )	2026-01-07 10:42:19 +01:00
parallel
passkey
retrieval	model : add LFM2-ColBert-350M (#18607 )	2026-01-05 19:52:56 +01:00
save-load-state
simple
simple-chat
simple-cmake-pkg
speculative
speculative-simple
sycl
training
CMakeLists.txt	examples : add debug utility/example (#18464 )	2026-01-07 10:42:19 +01:00
convert_legacy_llama.py
json_schema_pydantic_example.py
json_schema_to_grammar.py
llama.vim
pydantic_models_to_grammar.py
pydantic_models_to_grammar_examples.py
reason-act.sh
regex_to_grammar.py
server-llama2-13B.sh
server_embd.py
ts-type-to-grammar.sh