hf_text-generation-inference/backends/v3/src
Nicolas Patry 0204946d26
Max token capacity metric (#2595)
* adding max_token_capacity_metric

* added tgi to name of metric

* Adding max capacity metric.

* Add description for the metrics

---------

Co-authored-by: Edwinhr716 <Edandres249@gmail.com>
2024-10-02 16:32:36 +02:00
..
client Lots of improvements (Still 2 allocators) (#2449) 2024-08-29 16:29:01 +02:00
backend.rs Prefix test - Different kind of load test to trigger prefix test bugs. (#2490) 2024-09-11 18:10:40 +02:00
block_allocator.rs Lots of improvements (Still 2 allocators) (#2449) 2024-08-29 16:29:01 +02:00
lib.rs Max token capacity metric (#2595) 2024-10-02 16:32:36 +02:00
main.rs Pr 2352 ci branch (#2382) 2024-08-09 10:54:32 +02:00
queue.rs Adding a test for FD. (#2516) 2024-09-16 17:00:54 +02:00
radix.rs Adding a test for FD. (#2516) 2024-09-16 17:00:54 +02:00