llama_cpp_for_radxa_dragon_wing_q6a

pingu_98/llama_cpp_for_radxa_dragon_wing_q6a

History

fairydreaming 807b0c49ff Inference support for T5 and FLAN-T5 model families (#5763 ) * llama : add inference support and model types for T5 and FLAN-T5 model families * llama : add new API functions to support encoder-decoder models: llama_encode(), llama_model_has_encoder(), llama_model_decoder_start_token() * common, llama-cli, llama-batched : add support for encoder-decoder models * convert-hf : handle shared token embeddings tensors in T5Model * convert-hf : add support for SentencePiece BPE tokenizer in T5Model (for Pile-T5 models) * convert-hf : add MT5ForConditionalGeneration and UMT5ForConditionalGeneration to architectures supported by T5Model * convert : add t5 tokenizer tests, use "slow" HF tokenizer for t5 --------- Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>		2024-07-04 15:46:11 +02:00
..
baby-llama
batched	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
batched-bench
batched.swift
benchmark
convert-llama2c-to-ggml
cvector-generator
embedding	Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (#8258 )	2024-07-02 12:18:10 -04:00
eval-callback
export-lora
finetune
gbnf-validator
gguf
gguf-split
gritlm
imatrix
infill	Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (#8258 )	2024-07-02 12:18:10 -04:00
jeopardy
llama-bench
llama.android
llama.swiftui
llava
lookahead
lookup	Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (#8258 )	2024-07-02 12:18:10 -04:00
main	Inference support for T5 and FLAN-T5 model families (#5763 )	2024-07-04 15:46:11 +02:00
main-cmake-pkg	Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (#8258 )	2024-07-02 12:18:10 -04:00
parallel
passkey
perplexity	ppl : fix n_seq_max for perplexity (#8277 )	2024-07-03 20:33:31 +03:00
quantize
quantize-stats
retrieval
rpc
save-load-state
server	Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (#8258 )	2024-07-02 12:18:10 -04:00
simple
speculative
sycl	Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (#8258 )	2024-07-02 12:18:10 -04:00
tokenize
train-text-from-scratch
base-translate.sh
chat-13B.bat
chat-13B.sh
chat-persistent.sh
chat-vicuna.sh
chat.sh
CMakeLists.txt
convert-legacy-llama.py
json-schema-pydantic-example.py
json_schema_to_grammar.py
llama.vim
llm.vim
Miku.sh
pydantic-models-to-grammar-examples.py
pydantic_models_to_grammar.py
reason-act.sh
regex-to-grammar.py
server-embd.py	Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (#8258 )	2024-07-02 12:18:10 -04:00
server-llama2-13B.sh
ts-type-to-grammar.sh