preemo_text-generation-infe.../router/src
Michael Feil 339ede9e90
Update Readme.md / documentation (#15)
* add documentation updates

* update readme

* Update README.md
2023-10-03 23:01:06 -07:00
..
completion.rs Wrapping completions and chat/completions endpoint (#2) 2023-09-27 08:58:07 -07:00
health.rs feat(server): only compute prefill logprobs when asked (#406) 2023-06-02 17:12:30 +02:00
infer.rs feat(server): auto max_batch_total_tokens for flash att models (#630) 2023-07-19 09:31:25 +02:00
lib.rs Wrapping completions and chat/completions endpoint (#2) 2023-09-27 08:58:07 -07:00
main.rs feat(server): update vllm version (#723) 2023-07-28 15:36:38 +02:00
queue.rs feat(server): auto max_batch_total_tokens for flash att models (#630) 2023-07-19 09:31:25 +02:00
server.rs Update Readme.md / documentation (#15) 2023-10-03 23:01:06 -07:00
validation.rs Compilation fix: Correct method argument types in generation.rs and validation.rs (#10) 2023-08-23 13:52:49 -07:00