hf_text-generation-inference

History

OlivierDehaene 8bd0adb135 fix(server): fix quantization python requirements (#708 )		2023-07-27 12:28:10 +02:00
..
custom_autotune.py	feat(server): Add inference support for GPTQ (llama + falcon tested) + Quantization script (#438 )	2023-06-26 12:27:01 +02:00
exllama.py	fix(server): fix exllama buffers (#689 )	2023-07-24 14:25:43 +02:00
quant_linear.py	feat(server): Add inference support for GPTQ (llama + falcon tested) + Quantization script (#438 )	2023-06-26 12:27:01 +02:00
quantize.py	fix(server): fix quantization python requirements (#708 )	2023-07-27 12:28:10 +02:00