hf_text-generation-inference/server/text_generation/models
OlivierDehaene c6e8b9442b
fix(server): fix quantization for sharded models (#45)
2023-01-31 17:40:38 +01:00
..
__init__.py feat(server): Support SantaCoder (#26) 2023-01-20 12:24:39 +01:00
bloom.py fix(server): fix quantization for sharded models (#45) 2023-01-31 17:40:38 +01:00
causal_lm.py feat: Add token streaming using ServerSideEvents support (#41) 2023-01-31 17:04:00 +01:00
galactica.py fix(server): fix quantization for sharded models (#45) 2023-01-31 17:40:38 +01:00
model.py fix(server): Minor refactorization using new_zeros (#24) 2023-01-17 09:10:22 +01:00
santacoder.py feat: Support sampling seeding (#37) 2023-01-30 15:36:16 +01:00
seq2seq_lm.py feat: Add token streaming using ServerSideEvents support (#41) 2023-01-31 17:04:00 +01:00
types.py feat: Add token streaming using ServerSideEvents support (#41) 2023-01-31 17:04:00 +01:00