This website requires JavaScript.
Explore
Gist
Help
Register
Sign In
Mirrors
/
hf_text-generation-inference
mirror of
https://github.com/huggingface/text-generation-inference.git
Watch
1
Star
0
Fork
You've already forked hf_text-generation-inference
0
Code
Issues
Packages
Projects
Releases
Wiki
Activity
bb2b2959a2
hf_text-generation-inference
/
server
/
text_generation_server
/
utils
/
gptq
History
OlivierDehaene
9946165ee0
chore: add pre-commit (
#1569
)
2024-02-16 11:58:58 +01:00
..
custom_autotune.py
fix: fix quant linear autotune
2023-12-14 16:45:47 +01:00
exllama.py
fix: fix gpt-q with groupsize = -1 (
#1358
)
2023-12-18 16:07:05 +01:00
exllamav2.py
GPTQ support on ROCm (
#1489
)
2024-01-26 16:27:44 +01:00
quant_linear.py
chore: add pre-commit (
#1569
)
2024-02-16 11:58:58 +01:00
quantize.py
feat: format code (
#1070
)
2023-09-27 12:22:09 +02:00