hf_text-generation-inference

History

OlivierDehaene c6e8b9442b fix(server): fix quantization for sharded models (#45 )		2023-01-31 17:40:38 +01:00
..
__init__.py	feat(server): Support SantaCoder (#26 )	2023-01-20 12:24:39 +01:00
bloom.py	fix(server): fix quantization for sharded models (#45 )	2023-01-31 17:40:38 +01:00
causal_lm.py	feat: Add token streaming using ServerSideEvents support (#41 )	2023-01-31 17:04:00 +01:00
galactica.py	fix(server): fix quantization for sharded models (#45 )	2023-01-31 17:40:38 +01:00
model.py	fix(server): Minor refactorization using new_zeros (#24 )	2023-01-17 09:10:22 +01:00
santacoder.py	feat: Support sampling seeding (#37 )	2023-01-30 15:36:16 +01:00
seq2seq_lm.py	feat: Add token streaming using ServerSideEvents support (#41 )	2023-01-31 17:04:00 +01:00
types.py	feat: Add token streaming using ServerSideEvents support (#41 )	2023-01-31 17:04:00 +01:00