hf_text-generation-inference/router/src
OlivierDehaene e28a809004
v0.9.0 (#525)
2023-07-01 19:25:41 +02:00
..
health.rs feat(server): only compute prefill logprobs when asked (#406) 2023-06-02 17:12:30 +02:00
infer.rs feat(server): add paged attention to flash models (#516) 2023-06-30 19:09:59 +02:00
lib.rs chore: update openapi schema 2023-06-05 18:16:08 +02:00
main.rs feat(router): arg validation (#519) 2023-06-30 20:07:49 +02:00
queue.rs feat(server): add paged attention to flash models (#516) 2023-06-30 19:09:59 +02:00
server.rs v0.9.0 (#525) 2023-07-01 19:25:41 +02:00
validation.rs feat(server): only compute prefill logprobs when asked (#406) 2023-06-02 17:12:30 +02:00