|
batched-bench
|
ggml : add Flash Attention (#5021)
|
2024-04-30 12:16:08 +03:00 |
|
convert-llama2c-to-ggml
|
TypoFix (#7162)
|
2024-05-09 10:16:45 +02:00 |
|
finetune
|
ggml : introduce bfloat16 support (#6412)
|
2024-05-08 09:30:09 +03:00 |
|
llama-bench
|
llama-bench : add pp+tg test type (#7199)
|
2024-05-10 18:03:54 +02:00 |
|
lookup
|
Server: fix seed for multiple slots (#6835)
|
2024-04-24 11:08:36 +02:00 |
|
main
|
Fix memory bug in grammar parser (#7194)
|
2024-05-10 21:01:08 +10:00 |
|
rpc
|
rpc : set SO_REUSEADDR for the server socket (#7320)
|
2024-05-17 17:25:44 +03:00 |
|
server
|
server : add support for the RPC backend (#7305)
|
2024-05-17 10:00:17 +03:00 |
|
sycl
|
docs: fix typos (#7124)
|
2024-05-07 18:20:33 +03:00 |
|
CMakeLists.txt
|
ggml : add RPC backend (#6829)
|
2024-05-14 14:27:19 +03:00 |