batched-bench
ggml : add Flash Attention ( #5021 )
2024-04-30 12:16:08 +03:00
convert-llama2c-to-ggml
TypoFix ( #7162 )
2024-05-09 10:16:45 +02:00
finetune
ggml : introduce bfloat16 support ( #6412 )
2024-05-08 09:30:09 +03:00
llama-bench
llama-bench : add pp+tg test type ( #7199 )
2024-05-10 18:03:54 +02:00
llava
llava-cli: fix base64 prompt ( #7248 )
2024-05-14 00:02:36 +10:00
lookup
Server: fix seed for multiple slots ( #6835 )
2024-04-24 11:08:36 +02:00
main
Fix memory bug in grammar parser ( #7194 )
2024-05-10 21:01:08 +10:00
perplexity
perplexity: add BF16 vs. FP16 results ( #7150 )
2024-05-13 13:03:27 +02:00
quantize
ggml : introduce bfloat16 support ( #6412 )
2024-05-08 09:30:09 +03:00
sycl
docs: fix typos ( #7124 )
2024-05-07 18:20:33 +03:00