hf_text-generation-inference/router/src
drbh e36dfaa8de
feat: allow tool calling to respond without a tool (#2614)
* feat: process token stream before returning to client

* fix: expect content in test

* fix: improve comparison via ruff lint

* fix: return event in all cases

* fix: always send event on error, avoid unwraps, refactor and improve tests

* fix: prefer no_tool over notify_error to improve reponse

* fix: adjust chat input test for no_tool

* fix: adjust test expected content

---------

Co-authored-by: System administrator <root@ip-10-90-0-186.ec2.internal>
2024-10-10 09:28:25 -04:00
..
infer feat: allow tool calling to respond without a tool (#2614) 2024-10-10 09:28:25 -04:00
config.rs Mllama flash version (#2585) 2024-10-02 11:22:13 +02:00
kserve.rs fix: simplify kserve endpoint and fix imports (#2119) 2024-06-25 19:30:10 -04:00
lib.rs CI (2599): Update ToolType input schema (#2601) 2024-10-08 12:35:48 -04:00
logging.rs Rebase TRT-llm (#2331) 2024-07-31 10:33:10 +02:00
main.rs.back Rebase TRT-llm (#2331) 2024-07-31 10:33:10 +02:00
server.rs feat: allow tool calling to respond without a tool (#2614) 2024-10-10 09:28:25 -04:00
usage_stats.rs refactor usage stats (#2339) 2024-07-31 16:29:07 +02:00
validation.rs Mllama flash version (#2585) 2024-10-02 11:22:13 +02:00
vertex.rs Cleanup Vertex + Chat (#2553) 2024-09-24 23:37:17 +02:00