This website requires JavaScript.
Explore
Gist
Help
Register
Sign In
Mirrors
/
hf_text-generation-inference
mirror of
https://github.com/huggingface/text-generation-inference.git
Watch
1
Star
0
Fork
You've already forked hf_text-generation-inference
0
Code
Issues
Packages
Projects
Releases
Wiki
Activity
5a1cccbb98
hf_text-generation-inference
/
server
/
text_generation_server
/
utils
/
gptq
History
OlivierDehaene
8bd0adb135
fix(server): fix quantization python requirements (
#708
)
2023-07-27 12:28:10 +02:00
..
custom_autotune.py
feat(server): Add inference support for GPTQ (llama + falcon tested) + Quantization script (
#438
)
2023-06-26 12:27:01 +02:00
exllama.py
fix(server): fix exllama buffers (
#689
)
2023-07-24 14:25:43 +02:00
quant_linear.py
feat(server): Add inference support for GPTQ (llama + falcon tested) + Quantization script (
#438
)
2023-06-26 12:27:01 +02:00
quantize.py
fix(server): fix quantization python requirements (
#708
)
2023-07-27 12:28:10 +02:00