hf_text-generation-inference/server/text_generation_server/models/custom_modeling
OlivierDehaene e14ae3b5e9
feat(server): support quantization for flash models (#200)
closes #197
2023-04-19 12:51:11 +02:00
..
__init__.py feat(server): flash santacoder (#153) 2023-04-03 19:06:42 +02:00
flash_llama_modeling.py feat(server): support quantization for flash models (#200) 2023-04-19 12:51:11 +02:00
flash_neox_modeling.py feat(server): support quantization for flash models (#200) 2023-04-19 12:51:11 +02:00
flash_santacoder_modeling.py feat(server): support quantization for flash models (#200) 2023-04-19 12:51:11 +02:00