-
vllm-gptq Wheels Stable
released this
2023-09-12 00:10:38 -06:00 | 168 commits to master since this releaseCommit: b633191
vllm_gptq-0.1.3-py3-none-any.whl
is NOT compiled with CUDA. It's just the VLLM part.E5-2620_a4000__vllm-0.1.3-cp311-cp311-linux_x86_64.whl
: compiled on an E5-2620 with an A4000.E5-2620v4_a4000__vllm-0.1.3-cp311-cp311-linux_x86_64.whl
compiled on an E5-2620v4 with an A4000.(make sure to remove the
E5-XXX_a4000__
part before installing these wheels)Downloads