* model: dbrx convert to gguf #6344 * llama: support dbrx #6344 * doc: dbrx: add the model as supported * scripts: get-wikitext-2 add unzip * llama: increase maximum experts allowed * llama: factorize moe graph implementation between grok, mixtral and dbrx --------- Co-authored-by: Megha Agarwal <16129366+megha95@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| build-info.cmake | ||
| build-info.sh | ||
| check-requirements.sh | ||
| ci-run.sh | ||
| compare-commits.sh | ||
| compare-llama-bench.py | ||
| convert-gg.sh | ||
| gen-authors.sh | ||
| gen-build-info-cpp.cmake | ||
| get-flags.mk | ||
| get-hellaswag.sh | ||
| get-pg.sh | ||
| get-wikitext-2.sh | ||
| get-wikitext-103.sh | ||
| get-winogrande.sh | ||
| hf.sh | ||
| install-oneapi.bat | ||
| LlamaConfig.cmake.in | ||
| pod-llama.sh | ||
| qnt-all.sh | ||
| run-all-perf.sh | ||
| run-all-ppl.sh | ||
| run-with-preset.py | ||
| server-llm.sh | ||
| sync-ggml-am.sh | ||
| sync-ggml.last | ||
| sync-ggml.sh | ||
| verify-checksum-models.py | ||