hf_text-generation-inference/.github
drbh ad94f299f4 feat: compile vllm for cuda after flash_attn 2024-05-28 11:58:47 -04:00
..
ISSUE_TEMPLATE chore: add pre-commit (#1569) 2024-02-16 11:58:58 +01:00
workflows feat: compile vllm for cuda after flash_attn 2024-05-28 11:58:47 -04:00
PULL_REQUEST_TEMPLATE.md