hf_text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git

History

Nicolas Patry a1aac7843b Choosing input/total tokens automatically based on available VRAM?		2024-10-21 15:06:36 +02:00
..
src	Choosing input/total tokens automatically based on available VRAM?	2024-10-21 15:06:36 +02:00
Cargo.toml	CI (2592): Allow LoRA adapter revision in server launcher (#2602 )	2024-10-02 10:51:04 -04:00
build.rs	chore(github): add templates (#264 )	2023-05-02 15:43:19 +02:00