| .. |
|
arg.cpp
|
presets : fix pooling param for embedding models (#16455)
|
2025-10-07 10:32:32 +03:00 |
|
arg.h
|
common : remove common_has_curl() (#16351)
|
2025-09-30 17:39:44 +03:00 |
|
base64.hpp
|
|
|
|
build-info.cpp.in
|
|
|
|
chat-parser.cpp
|
model : Apertus model implementation (#15852)
|
2025-10-02 20:43:22 +03:00 |
|
chat-parser.h
|
model : Apertus model implementation (#15852)
|
2025-10-02 20:43:22 +03:00 |
|
chat.cpp
|
chat : support Magistral thinking (#16413)
|
2025-10-03 21:51:48 +03:00 |
|
chat.h
|
chat : support Magistral thinking (#16413)
|
2025-10-03 21:51:48 +03:00 |
|
CMakeLists.txt
|
common: introduce http.h for httplib-based client (#16373)
|
2025-10-01 20:22:18 +03:00 |
|
common.cpp
|
llama : add --no-host to disable host buffers (#16310)
|
2025-10-06 19:55:53 +02:00 |
|
common.h
|
llama : add --no-host to disable host buffers (#16310)
|
2025-10-06 19:55:53 +02:00 |
|
console.cpp
|
|
|
|
console.h
|
|
|
|
http.h
|
common: introduce http.h for httplib-based client (#16373)
|
2025-10-01 20:22:18 +03:00 |
|
json-partial.cpp
|
|
|
|
json-partial.h
|
|
|
|
json-schema-to-grammar.cpp
|
common : Fix corrupted memory error on json grammar initialization (#16038)
|
2025-09-17 11:08:02 +03:00 |
|
json-schema-to-grammar.h
|
|
|
|
llguidance.cpp
|
|
|
|
log.cpp
|
Implement --log-colors with always/never/auto (#15792)
|
2025-09-05 19:43:59 +01:00 |
|
log.h
|
Implement --log-colors with always/never/auto (#15792)
|
2025-09-05 19:43:59 +01:00 |
|
ngram-cache.cpp
|
|
|
|
ngram-cache.h
|
|
|
|
regex-partial.cpp
|
|
|
|
regex-partial.h
|
|
|
|
sampling.cpp
|
llama: print memory breakdown on exit (#15860)
|
2025-09-24 16:53:48 +02:00 |
|
sampling.h
|
sampling : optimize samplers by reusing bucket sort (#15665)
|
2025-08-31 20:41:02 +03:00 |
|
speculative.cpp
|
sampling : optimize samplers by reusing bucket sort (#15665)
|
2025-08-31 20:41:02 +03:00 |
|
speculative.h
|
server : implement universal assisted decoding (#12635)
|
2025-07-31 14:25:23 +02:00 |