hf_text-generation-inference/backends/v3
Nicolas Patry 0204946d26
Max token capacity metric (#2595)
* adding max_token_capacity_metric

* added tgi to name of metric

* Adding max capacity metric.

* Add description for the metrics

---------

Co-authored-by: Edwinhr716 <Edandres249@gmail.com>
2024-10-02 16:32:36 +02:00
..
benches Keeping the benchmark somewhere (#2401) 2024-08-12 15:22:02 +02:00
src Max token capacity metric (#2595) 2024-10-02 16:32:36 +02:00
Cargo.toml fix: bump minijinja version and add test for llama 3.1 tools (#2463) 2024-08-27 13:31:08 -04:00
build.rs Rebase TRT-llm (#2331) 2024-07-31 10:33:10 +02:00