|
batched-bench
|
ggml : add Flash Attention (#5021)
|
2024-04-30 12:16:08 +03:00 |
|
convert-llama2c-to-ggml
|
TypoFix (#7162)
|
2024-05-09 10:16:45 +02:00 |
|
finetune
|
ggml : introduce bfloat16 support (#6412)
|
2024-05-08 09:30:09 +03:00 |
|
llama-bench
|
llama-bench : add pp+tg test type (#7199)
|
2024-05-10 18:03:54 +02:00 |
|
llama.android
|
move ndk code to a new library (#6951)
|
2024-05-14 17:30:30 +10:00 |
|
llava
|
llava-cli: fix base64 prompt (#7248)
|
2024-05-14 00:02:36 +10:00 |
|
lookup
|
Server: fix seed for multiple slots (#6835)
|
2024-04-24 11:08:36 +02:00 |
|
main
|
Fix memory bug in grammar parser (#7194)
|
2024-05-10 21:01:08 +10:00 |
|
perplexity
|
perplexity: add BF16 vs. FP16 results (#7150)
|
2024-05-13 13:03:27 +02:00 |
|
quantize
|
ggml : introduce bfloat16 support (#6412)
|
2024-05-08 09:30:09 +03:00 |
|
sycl
|
docs: fix typos (#7124)
|
2024-05-07 18:20:33 +03:00 |