hf_text-generation-inference/router/src
drbh 5489406c4a
PR 2634 CI - Fix the tool_choice format for named choice by adapting OpenAIs scheme (#2645)
* add OpenAI like tool_choice for named choice

* add tests

* fix: run linter and bump api docs

* fix: consolidate changes and remove old tool type

* feat: improve, simplify and rename tool choice struct add required support and refactor

* fix: simplify tool choice logic, improve tests, openapi and rust docs

* fix: refactor away prepare_chat_input and improve tool grammar apply control flow

* feat: update docs and add tool choice configuration section

* fix: simplify naming, tool choice default and improve test

* fix: adjust tool choice none logic, add test and small refactors

* fix: add missing snapshot file

* fix: adjust tool choice type in test

* fix: adjust default when json tool choice is

* fix: remove trailing space lint after rebase

* fix: remove mostly mocked unit test

---------

Co-authored-by: Linus Bierhoff <linus.bierhoff@icloud.com>
2024-11-19 13:31:59 -05:00
..
infer PR 2634 CI - Fix the tool_choice format for named choice by adapting OpenAIs scheme (#2645) 2024-11-19 13:31:59 -05:00
config.rs Support qwen2 vl (#2689) 2024-10-30 12:40:51 -04:00
kserve.rs fix: simplify kserve endpoint and fix imports (#2119) 2024-06-25 19:30:10 -04:00
lib.rs PR 2634 CI - Fix the tool_choice format for named choice by adapting OpenAIs scheme (#2645) 2024-11-19 13:31:59 -05:00
logging.rs Rebase TRT-llm (#2331) 2024-07-31 10:33:10 +02:00
sagemaker.rs feat: allow any supported payload on /invocations (#2683) 2024-10-23 11:26:01 +00:00
server.rs PR 2634 CI - Fix the tool_choice format for named choice by adapting OpenAIs scheme (#2645) 2024-11-19 13:31:59 -05:00
usage_stats.rs feat: allow any supported payload on /invocations (#2683) 2024-10-23 11:26:01 +00:00
validation.rs add trust_remote_code in tokenizer to fix baichuan issue (#2725) 2024-11-07 14:43:38 +01:00
vertex.rs Rollback to `ChatRequest` for Vertex AI Chat instead of `VertexChat` (#2651) 2024-10-15 18:11:59 +02:00