hf_text-generation-inference/router/src
drbh f433f1f770
implement Open Inference Protocol endpoints (#1942)
* feat: add kserve feature and basic routes

* feat: implement infer endpoint wrapper around generate

* fix: refactor and improve types

* fix: improve infer and simplify

* fix: cleanup and improve api docs

* fix: refactor and encapsulate kserve feat in file

* fix: remove typos after rebase
2024-06-13 12:51:51 -04:00
..
infer PR #2049 CI run (#2054) 2024-06-13 11:53:49 -04:00
config.rs router: send the input as chunks to the backend 2024-06-03 17:02:41 +02:00
kserve.rs implement Open Inference Protocol endpoints (#1942) 2024-06-13 12:51:51 -04:00
lib.rs implement Open Inference Protocol endpoints (#1942) 2024-06-13 12:51:51 -04:00
main.rs feat: add SchedulerV3 (#1996) 2024-06-04 15:56:56 +02:00
server.rs implement Open Inference Protocol endpoints (#1942) 2024-06-13 12:51:51 -04:00
validation.rs feat: add SchedulerV3 (#1996) 2024-06-04 15:56:56 +02:00