hf_text-generation-inference/server/exllama_kernels/exllama_kernels
OlivierDehaene 9946165ee0
chore: add pre-commit (#1569)
2024-02-16 11:58:58 +01:00
..
cuda_func chore: add pre-commit (#1569) 2024-02-16 11:58:58 +01:00
cu_compat.cuh GPTQ support on ROCm (#1489) 2024-01-26 16:27:44 +01:00
cuda_buffers.cu feat(server): Add exllama GPTQ CUDA kernel support #553 (#666) 2023-07-21 10:59:00 +02:00
cuda_buffers.cuh feat(server): Add exllama GPTQ CUDA kernel support #553 (#666) 2023-07-21 10:59:00 +02:00
exllama_ext.cpp feat: experimental support for cuda graphs (#1428) 2024-02-12 10:09:29 +01:00
hip_compat.cuh chore: add pre-commit (#1569) 2024-02-16 11:58:58 +01:00
matrix.cuh feat(server): Add exllama GPTQ CUDA kernel support #553 (#666) 2023-07-21 10:59:00 +02:00
tuning.h feat(server): Add exllama GPTQ CUDA kernel support #553 (#666) 2023-07-21 10:59:00 +02:00
util.cuh GPTQ support on ROCm (#1489) 2024-01-26 16:27:44 +01:00