This website requires JavaScript.
Explore
Gist
Help
Register
Sign In
Mirrors
/
hf_text-generation-inference
mirror of
https://github.com/huggingface/text-generation-inference.git
Watch
1
Star
0
Fork
You've already forked hf_text-generation-inference
0
Code
Issues
Packages
Projects
Releases
Wiki
Activity
fp8_rocm
hf_text-generation-inference
/
server
/
text_generation_server
/
layers
/
attention
History
Mohit Sharma
473d9a892d
Merge remote-tracking branch 'upstream/main' into rocm_6.2_updates
2024-09-27 15:36:12 +00:00
..
__init__.py
Improve support for GPUs with capability < 8 (
#2575
)
2024-09-27 16:19:42 +02:00
common.py
fix issue for sliding window models
2024-09-24 10:53:19 +00:00
cuda.py
Improve support for GPUs with capability < 8 (
#2575
)
2024-09-27 16:19:42 +02:00
flash_attn_triton.py
feat: add ruff and resolve issue (
#2262
)
2024-07-26 10:29:09 -04:00
flashinfer.py
More tensor cores. (
#2558
)
2024-09-24 23:57:26 +02:00
ipex.py
Improve support for GPUs with capability < 8 (
#2575
)
2024-09-27 16:19:42 +02:00
rocm.py
Merge remote-tracking branch 'upstream/main' into rocm_6.2_updates
2024-09-27 15:36:12 +00:00