This website requires JavaScript.
Explore
Help
Sign In
pingu_98
/
llama_cpp_for_radxa_dragon_wing_q6a
Watch
1
Star
0
Fork
You've already forked llama_cpp_for_radxa_dragon_wing_q6a
0
Code
Issues
Pull requests
Projects
Releases
Packages
Wiki
Activity
Actions
91159ee9df
llama_cpp_for_radxa_dragon_...
/
tests
History
David Huang
7f323a589f
Add
--no-op-offload
to improve
-ot
pp perf in MoE models like llama4 400B (
#13386
)
2025-05-11 14:18:39 +02:00
..
.gitignore
CMakeLists.txt
get-model.cpp
get-model.h
run-json-schema-to-grammar.mjs
test-arg-parser.cpp
test-autorelease.cpp
test-backend-ops.cpp
test-barrier.cpp
test-c.c
test-chat-template.cpp
test-chat.cpp
test-double-float.cpp
test-gbnf-validator.cpp
test-gguf.cpp
test-grammar-integration.cpp
test-grammar-llguidance.cpp
test-grammar-parser.cpp
test-json-schema-to-grammar.cpp
test-llama-grammar.cpp
test-log.cpp
test-lora-conversion-inference.sh
test-model-load-cancel.cpp
test-mtmd-c-api.c
test-opt.cpp
Add
--no-op-offload
to improve
-ot
pp perf in MoE models like llama4 400B (
#13386
)
2025-05-11 14:18:39 +02:00
test-quantize-fns.cpp
test-quantize-perf.cpp
test-quantize-stats.cpp
test-rope.cpp
test-sampling.cpp
sampling : make top_n_sigma no-op at <=0 or a single candidate (
#13345
)
2025-05-06 22:36:24 +02:00
test-tokenizer-0.cpp
test-tokenizer-0.py
test-tokenizer-0.sh
test-tokenizer-1-bpe.cpp
test-tokenizer-1-spm.cpp
test-tokenizer-random.py