hf_text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git

History

OlivierDehaene e14ae3b5e9 feat(server): support quantization for flash models (#200 ) closes #197		2023-04-19 12:51:11 +02:00
..
__init__.py	feat(server): flash santacoder (#153 )	2023-04-03 19:06:42 +02:00
flash_llama_modeling.py	feat(server): support quantization for flash models (#200 )	2023-04-19 12:51:11 +02:00
flash_neox_modeling.py	feat(server): support quantization for flash models (#200 )	2023-04-19 12:51:11 +02:00
flash_santacoder_modeling.py	feat(server): support quantization for flash models (#200 )	2023-04-19 12:51:11 +02:00