hf_text-generation-inference/router/client
OlivierDehaene fe80f5360c
feat(server): auto max_batch_total_tokens for flash att models (#630)
2023-07-19 09:31:25 +02:00
..
src feat(server): auto max_batch_total_tokens for flash att models (#630) 2023-07-19 09:31:25 +02:00
Cargo.toml v0.9.0 (#525) 2023-07-01 19:25:41 +02:00
build.rs feat: add distributed tracing (#62) 2023-02-13 13:02:45 +01:00