llama_cpp_for_radxa_dragon_.../tools
Fredrik Hultin ddf9f94389
server : add Anthropic Messages API support (#17570)
* server : add Anthropic Messages API support

* remove -@pytest.mark.slow from tool calling/jinja tests

* server : remove unused code and slow/skip on test_anthropic_vision_base64_with_multimodal_model in test_anthropic_api.py

* server : removed redundant n field logic in anthropic_params_from_json

* server : use single error object instead of error_array in streaming response handler for /v1/chat/completions and use unordered_set instead of set in to_json_anthropic_stream()

* server : refactor Anthropic API to use OAI conversion

* make sure basic test always go first

* clean up

* clean up api key check, add test

---------

Co-authored-by: Xuan Son Nguyen <son@huggingface.co>
2025-11-28 12:57:04 +01:00
..
batched-bench
cvector-generator
export-lora
gguf-split
imatrix
llama-bench
main common : more accurate sampling timing (#17382) 2025-11-20 13:40:10 +02:00
mtmd clip: (minicpmv) fix resampler kq_scale (#17516) 2025-11-26 21:44:07 +01:00
perplexity
quantize
rpc
run
server server : add Anthropic Messages API support (#17570) 2025-11-28 12:57:04 +01:00
tokenize
tts
CMakeLists.txt