This website requires JavaScript.
Explore
Gist
Help
Register
Sign In
Mirrors
/
hf_text-generation-inference
mirror of
https://github.com/huggingface/text-generation-inference.git
Watch
1
Star
0
Fork
You've already forked hf_text-generation-inference
0
Code
Issues
Packages
Projects
Releases
Wiki
Activity
180
Commits
181
Branches
49
Tags
605
MiB
343437c7b5
Commit Graph
3 Commits
Author
SHA1
Message
Date
OlivierDehaene
343437c7b5
feat(router): add device and dtype info (
#215
)
2023-04-21 15:36:29 +02:00
OlivierDehaene
e14ae3b5e9
feat(server): support quantization for flash models (
#200
)
...
closes
#197
2023-04-19 12:51:11 +02:00
OlivierDehaene
299217c95c
feat(server): add flash attention llama (
#144
)
2023-04-11 16:38:22 +02:00