cb150eb295
Use FP8 GPTQ-Marlin kernels to enable FP8 support on CUDA GPUs with compute capability >=8.0 and <8.9. Co-authored-by: Florian Zimmermeister <flozi00.fz@gmail.com> |
||
---|---|---|
.. | ||
sparse | ||
__init__.pyi | ||
ext.cpp | ||
ext.hh | ||
fp8_marlin.cu | ||
gptq_marlin.cu | ||
gptq_marlin.cuh | ||
gptq_marlin_dtypes.cuh | ||
gptq_marlin_repack.cu | ||
marlin_cuda_kernel.cu | ||
py.typed |