1e646fb41d
* fix: Correct method argument types in generation and validation In the `generation.rs` and `validation.rs` files, corrected the argument types passed to the `decode` method. Replaced `Vec<u32>` with `&[u32]` using the `as_ref()` method to match the expected argument types. This resolves the mismatched types compilation error during the Rust build process. Closes [#9](https://github.com/Preemo-Inc/text-generation-inference/issues/9) * Update benchmark/src/generation.rs Co-authored-by: Yang, Bo <pop.atry@gmail.com> * Update router/src/validation.rs Co-authored-by: Yang, Bo <pop.atry@gmail.com> --------- Co-authored-by: Yang, Bo <pop.atry@gmail.com> |
||
---|---|---|
.. | ||
src | ||
Cargo.toml | ||
README.md |
README.md
A lightweight benchmarking tool based inspired by oha and powered by tui.
Install
make install-benchmark
Run
First, start text-generation-inference
:
text-generation-launcher --model-id bigscience/bloom-560m
Then run the benchmarking tool:
text-generation-benchmark --tokenizer-name bigscience/bloom-560m