• vllm-gptq-b633191 747d838138

    cyberes released this 2023-09-12 00:10:38 -06:00 | 168 commits to master since this release

    Commit: b633191

    vllm_gptq-0.1.3-py3-none-any.whl is NOT compiled with CUDA. It's just the VLLM part.

    E5-2620_a4000__vllm-0.1.3-cp311-cp311-linux_x86_64.whl: compiled on an E5-2620 with an A4000.

    E5-2620v4_a4000__vllm-0.1.3-cp311-cp311-linux_x86_64.whl compiled on an E5-2620v4 with an A4000.

    (make sure to remove the E5-XXX_a4000__ part before installing these wheels)

    Downloads