hf_text-generation-inference/launcher
Nicolas Patry a04356fb8c
Attempt for cleverer auto batch_prefill values (some simplifications). (#2808)
* Attempt for cleverer auto batch_prefill values (some simplifications).

* Less flaky tests.

* Fixing typo insertion.

* Update launcher/src/main.rs

Co-authored-by: Daniël de Kok <me@danieldk.eu>

* Adding small comment for source of calculation.

* Adding L40.

* Adding L40s.

---------

Co-authored-by: Daniël de Kok <me@danieldk.eu>
2024-12-09 19:44:32 +01:00
..
src Attempt for cleverer auto batch_prefill values (some simplifications). (#2808) 2024-12-09 19:44:32 +01:00
Cargo.toml CI (2592): Allow LoRA adapter revision in server launcher (#2602) 2024-10-02 10:51:04 -04:00
build.rs chore(github): add templates (#264) 2023-05-02 15:43:19 +02:00