llama_cpp_for_radxa_dragon_wing_q6a

pingu_98/llama_cpp_for_radxa_dragon_wing_q6a

History

Radoslav Gerganov 898acba681 rpc : add support for multiple devices (#16276 ) * rpc : add support for multiple devices Allow rpc-server to expose multiple devices from a single endpoint. Change RPC protocol to include device identifier where needed. closes: #15210 * fixes * use ggml_backend_reg_t * address review comments * fix llama-bench backend report * address review comments, change device naming * fix cmd order		2025-10-04 12:49:16 +03:00
..
arg.cpp	rpc : add support for multiple devices (#16276 )	2025-10-04 12:49:16 +03:00
arg.h	common : remove common_has_curl() (#16351 )	2025-09-30 17:39:44 +03:00
base64.hpp
build-info.cpp.in
chat-parser.cpp	model : Apertus model implementation (#15852 )	2025-10-02 20:43:22 +03:00
chat-parser.h	model : Apertus model implementation (#15852 )	2025-10-02 20:43:22 +03:00
chat.cpp	chat : support Magistral thinking (#16413 )	2025-10-03 21:51:48 +03:00
chat.h	chat : support Magistral thinking (#16413 )	2025-10-03 21:51:48 +03:00
CMakeLists.txt	common: introduce http.h for httplib-based client (#16373 )	2025-10-01 20:22:18 +03:00
common.cpp	devops: add s390x & ppc64le CI (#15925 )	2025-09-27 02:03:33 +08:00
common.h	server : context checkpointing for hybrid and recurrent models (#16382 )	2025-10-03 21:34:51 +03:00
console.cpp
console.h
http.h	common: introduce http.h for httplib-based client (#16373 )	2025-10-01 20:22:18 +03:00
json-partial.cpp
json-partial.h
json-schema-to-grammar.cpp	common : Fix corrupted memory error on json grammar initialization (#16038 )	2025-09-17 11:08:02 +03:00
json-schema-to-grammar.h
llguidance.cpp
log.cpp
log.h
ngram-cache.cpp
ngram-cache.h
regex-partial.cpp
regex-partial.h
sampling.cpp	llama: print memory breakdown on exit (#15860 )	2025-09-24 16:53:48 +02:00
sampling.h
speculative.cpp
speculative.h