..
baby-llama
baby-llama : allocate graphs in ggml_context ( #5573 )
2024-02-19 10:25:38 +02:00
batched
ggml, common, examples, tests : fixed type arguments in printf ( #5528 )
2024-02-18 18:20:12 +02:00
batched-bench
ggml, common, examples, tests : fixed type arguments in printf ( #5528 )
2024-02-18 18:20:12 +02:00
batched.swift
ggml : add numa options ( #5377 )
2024-02-16 11:31:07 +02:00
beam-search
ggml : add numa options ( #5377 )
2024-02-16 11:31:07 +02:00
benchmark
convert-llama2c-to-ggml
ggml, common, examples, tests : fixed type arguments in printf ( #5528 )
2024-02-18 18:20:12 +02:00
embedding
ggml : add numa options ( #5377 )
2024-02-16 11:31:07 +02:00
export-lora
ci : add an option to fail on compile warning ( #3952 )
2024-02-17 23:03:14 +02:00
finetune
finetune : rename feed-forward tensors (w1/w2/w3) ( #4839 )
2024-02-13 15:15:42 +02:00
gguf
imatrix
ggml : add numa options ( #5377 )
2024-02-16 11:31:07 +02:00
infill
ggml : add numa options ( #5377 )
2024-02-16 11:31:07 +02:00
jeopardy
llama-bench
ggml : add numa options ( #5377 )
2024-02-16 11:31:07 +02:00
llama.android
ggml : add numa options ( #5377 )
2024-02-16 11:31:07 +02:00
llama.swiftui
ggml : add numa options ( #5377 )
2024-02-16 11:31:07 +02:00
llava
llava : add --skip-unknown to 1.6 convert.py ( #5632 )
2024-02-21 15:36:57 +02:00
lookahead
ggml : add numa options ( #5377 )
2024-02-16 11:31:07 +02:00
lookup
ggml : add numa options ( #5377 )
2024-02-16 11:31:07 +02:00
main
ggml : add numa options ( #5377 )
2024-02-16 11:31:07 +02:00
main-cmake-pkg
parallel
ggml : add numa options ( #5377 )
2024-02-16 11:31:07 +02:00
passkey
ggml : add numa options ( #5377 )
2024-02-16 11:31:07 +02:00
perplexity
ci : fix wikitext url + compile warnings ( #5569 )
2024-02-18 22:39:30 +02:00
quantize
IQ4_NL: 4-bit non-linear quants with blocks of 32 ( #5590 )
2024-02-21 11:39:52 +02:00
quantize-stats
refactor : switch to emplace_back to avoid extra object ( #5291 )
2024-02-03 13:23:37 +02:00
save-load-state
server
server: health: fix race condition on slots data using tasks queue ( #5634 )
2024-02-21 15:47:48 +01:00
simple
ggml : add numa options ( #5377 )
2024-02-16 11:31:07 +02:00
speculative
ggml : add numa options ( #5377 )
2024-02-16 11:31:07 +02:00
sycl
[SYCL] update guide of SYCL backend ( #5254 )
2024-02-02 15:53:27 +08:00
tokenize
ggml : add numa options ( #5377 )
2024-02-16 11:31:07 +02:00
train-text-from-scratch
ggml, common, examples, tests : fixed type arguments in printf ( #5528 )
2024-02-18 18:20:12 +02:00
alpaca.sh
base-translate.sh
chat-13B.bat
chat-13B.sh
chat-persistent.sh
chat-vicuna.sh
chat.sh
CMakeLists.txt
gguf : add python reader example ( #5216 )
2024-02-13 19:56:38 +02:00
gpt4all.sh
json-schema-to-grammar.py
examples : support minItems/maxItems in JSON grammar converter ( #5039 )
2024-02-19 16:14:07 +02:00
llama.vim
llama2-13b.sh
llama2.sh
llm.vim
make-ggml.py
Miku.sh
pydantic-models-to-grammar-examples.py
pydantic_models_to_grammar.py
reason-act.sh
server-llama2-13B.sh