llama_cpp_for_radxa_dragon_wing_q6a

pingu_98/llama_cpp_for_radxa_dragon_wing_q6a

History

Daniel Bevenius 8e3ead6e4d model-conversion : add device option to run-org-model.py (#18318 ) * model-conversion : add device option to run-org-model.py This commit refactors the `run-org-model.py` script to include a `--device` argument, to allow users to specify the device on which to run the model (e.g., cpu, cuda, mps, auto). It also extracts a few common functions to prepare for future changes where some code duplication will be removed which there currently exists in embedding scripts. The Makefile is also been updated to pass the device argument, for example: ```console (venv) $ make causal-verify-logits DEVICE=cpu ``` * fix error handling and remove parser reference This commit fixes the error handling which previously referenced an undefined 'parser' variable.		2025-12-23 14:07:25 +01:00
..
batched
batched.swift
convert-llama2c-to-ggml
deprecation-warning
diffusion
embedding
eval-callback
gen-docs	gen-docs: automatically update markdown file (#18294 )	2025-12-22 19:30:19 +01:00
gguf
gguf-hash
idle
llama.android	android: fix missing screenshots for Android.md (#18156 )	2025-12-19 09:32:04 +02:00
llama.swiftui
lookahead
lookup
model-conversion	model-conversion : add device option to run-org-model.py (#18318 )	2025-12-23 14:07:25 +01:00
parallel
passkey
retrieval
save-load-state
simple
simple-chat
simple-cmake-pkg
speculative	common : restore grammar-based rejection sampling (#18137 )	2025-12-17 19:46:00 +02:00
speculative-simple
sycl	[SYCL] replace llama-cli by llama-completion to rm the impact to test script (#18290 )	2025-12-23 12:59:12 +08:00
training
CMakeLists.txt
convert_legacy_llama.py
json_schema_pydantic_example.py
json_schema_to_grammar.py
llama.vim
pydantic_models_to_grammar.py
pydantic_models_to_grammar_examples.py
reason-act.sh
regex_to_grammar.py
server-llama2-13B.sh
server_embd.py
ts-type-to-grammar.sh