hf_text-generation-inference

History

Nicolas Patry a04356fb8c Attempt for cleverer auto batch_prefill values (some simplifications). (#2808 ) * Attempt for cleverer auto batch_prefill values (some simplifications). * Less flaky tests. * Fixing typo insertion. * Update launcher/src/main.rs Co-authored-by: Daniël de Kok <me@danieldk.eu> * Adding small comment for source of calculation. * Adding L40. * Adding L40s. --------- Co-authored-by: Daniël de Kok <me@danieldk.eu>		2024-12-09 19:44:32 +01:00
..
images	Pali gemma modeling (#1895 )	2024-05-16 06:58:47 +02:00
models	Attempt for cleverer auto batch_prefill values (some simplifications). (#2808 )	2024-12-09 19:44:32 +01:00
conftest.py	Monkey patching as a desperate measure. (#2704 )	2024-10-28 11:25:13 +01:00
poetry.lock	Prefix test - Different kind of load test to trigger prefix test bugs. (#2490 )	2024-09-11 18:10:40 +02:00
pyproject.toml	nix: add black and isort to the closure (#2619 )	2024-10-09 11:08:02 +02:00
pytest.ini	chore: add pre-commit (#1569 )	2024-02-16 11:58:58 +01:00
requirements.txt	Prefix test - Different kind of load test to trigger prefix test bugs. (#2490 )	2024-09-11 18:10:40 +02:00