hf_text-generation-inference/server/text_generation_server/models/custom_modeling
OlivierDehaene 299217c95c
feat(server): add flash attention llama (#144)
2023-04-11 16:38:22 +02:00
..
__init__.py feat(server): flash santacoder (#153) 2023-04-03 19:06:42 +02:00
flash_llama_modeling.py feat(server): add flash attention llama (#144) 2023-04-11 16:38:22 +02:00
flash_neox_modeling.py feat(router): make router input validation optional (#164) 2023-04-09 20:22:27 +02:00
flash_santacoder_modeling.py feat(router): make router input validation optional (#164) 2023-04-09 20:22:27 +02:00