llama_cpp_for_radxa_dragon_wing_q6a

History

Pierrick Hymbert 4bd0f93e4a model: support arch `DbrxForCausalLM` (#6515 ) * model: dbrx convert to gguf #6344 * llama: support dbrx #6344 * doc: dbrx: add the model as supported * scripts: get-wikitext-2 add unzip * llama: increase maximum experts allowed * llama: factorize moe graph implementation between grok, mixtral and dbrx --------- Co-authored-by: Megha Agarwal <16129366+megha95@users.noreply.github.com>		2024-04-13 11:33:52 +02:00
..
build-info.cmake
build-info.sh
check-requirements.sh
ci-run.sh
compare-commits.sh
compare-llama-bench.py	compare-llama-bench.py: fix long hexsha args (#6424 )	2024-04-01 13:30:43 +02:00
convert-gg.sh
gen-authors.sh	license : update copyright notice + add AUTHORS (#6405 )	2024-04-09 09:23:19 +03:00
gen-build-info-cpp.cmake
get-flags.mk
get-hellaswag.sh
get-pg.sh
get-wikitext-2.sh	model: support arch `DbrxForCausalLM` (#6515 )	2024-04-13 11:33:52 +02:00
get-wikitext-103.sh
get-winogrande.sh
hf.sh	scripts : add --outdir option to hf.sh (#6600 )	2024-04-11 16:22:47 +03:00
install-oneapi.bat
LlamaConfig.cmake.in
pod-llama.sh
qnt-all.sh
run-all-perf.sh
run-all-ppl.sh
run-with-preset.py
server-llm.sh
sync-ggml-am.sh	license : update copyright notice + add AUTHORS (#6405 )	2024-04-09 09:23:19 +03:00
sync-ggml.last	sync : ggml	2024-04-09 20:29:06 +03:00
sync-ggml.sh	license : update copyright notice + add AUTHORS (#6405 )	2024-04-09 09:23:19 +03:00
verify-checksum-models.py