hf_text-generation-inference/backends
Nicolas Patry 0204946d26
Max token capacity metric (#2595)
* adding max_token_capacity_metric

* added tgi to name of metric

* Adding max capacity metric.

* Add description for the metrics

---------

Co-authored-by: Edwinhr716 <Edandres249@gmail.com>
2024-10-02 16:32:36 +02:00
..
client Lots of improvements (Still 2 allocators) (#2449) 2024-08-29 16:29:01 +02:00
grpc-metadata Rebase TRT-llm (#2331) 2024-07-31 10:33:10 +02:00
trtllm More fixes trtllm (#2342) 2024-08-14 12:02:05 +02:00
v2 Hotfixing main (#2556) 2024-09-24 11:51:14 +02:00
v3 Max token capacity metric (#2595) 2024-10-02 16:32:36 +02:00