llama_cpp_for_radxa_dragon_wing_q6a

pingu_98/llama_cpp_for_radxa_dragon_wing_q6a

History

Georgi Gerganov ef4c5b87ea presets : fix pooling param for embedding models (#16455 )		2025-10-07 10:32:32 +03:00
..
arg.cpp	presets : fix pooling param for embedding models (#16455 )	2025-10-07 10:32:32 +03:00
arg.h	common : remove common_has_curl() (#16351 )	2025-09-30 17:39:44 +03:00
base64.hpp
build-info.cpp.in
chat-parser.cpp	model : Apertus model implementation (#15852 )	2025-10-02 20:43:22 +03:00
chat-parser.h	model : Apertus model implementation (#15852 )	2025-10-02 20:43:22 +03:00
chat.cpp	chat : support Magistral thinking (#16413 )	2025-10-03 21:51:48 +03:00
chat.h	chat : support Magistral thinking (#16413 )	2025-10-03 21:51:48 +03:00
CMakeLists.txt	common: introduce http.h for httplib-based client (#16373 )	2025-10-01 20:22:18 +03:00
common.cpp	llama : add --no-host to disable host buffers (#16310 )	2025-10-06 19:55:53 +02:00
common.h	llama : add --no-host to disable host buffers (#16310 )	2025-10-06 19:55:53 +02:00
console.cpp
console.h
http.h	common: introduce http.h for httplib-based client (#16373 )	2025-10-01 20:22:18 +03:00
json-partial.cpp
json-partial.h
json-schema-to-grammar.cpp	common : Fix corrupted memory error on json grammar initialization (#16038 )	2025-09-17 11:08:02 +03:00
json-schema-to-grammar.h
llguidance.cpp
log.cpp	Implement --log-colors with always/never/auto (#15792 )	2025-09-05 19:43:59 +01:00
log.h	Implement --log-colors with always/never/auto (#15792 )	2025-09-05 19:43:59 +01:00
ngram-cache.cpp
ngram-cache.h
regex-partial.cpp
regex-partial.h
sampling.cpp	llama: print memory breakdown on exit (#15860 )	2025-09-24 16:53:48 +02:00
sampling.h	sampling : optimize samplers by reusing bucket sort (#15665 )	2025-08-31 20:41:02 +03:00
speculative.cpp	sampling : optimize samplers by reusing bucket sort (#15665 )	2025-08-31 20:41:02 +03:00
speculative.h	server : implement universal assisted decoding (#12635 )	2025-07-31 14:25:23 +02:00