cb150eb295
Use FP8 GPTQ-Marlin kernels to enable FP8 support on CUDA GPUs with compute capability >=8.0 and <8.9. Co-authored-by: Florian Zimmermeister <flozi00.fz@gmail.com> |
||
---|---|---|
.. | ||
marlin_kernels | ||
COPYRIGHT | ||
setup.py |