hf_text-generation-inference/server/text_generation_server/layers/awq/quantize
Wang, Yi b1f9044d6c
fix incorrect output of Qwen2-7B-Instruct-GPTQ-Int4 and Qwen2-7B-Inst… (#2717)
fix incorrect output of Qwen2-7B-Instruct-GPTQ-Int4 and Qwen2-7B-Instruct-AWQ
ipex kernel provide func like add_bias, so no need add it outside

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
2024-11-04 16:07:51 +01:00
..
__init__.py CI job. Gpt awq 4 (#2665) 2024-10-18 17:55:53 +02:00
cuda.py CI job. Gpt awq 4 (#2665) 2024-10-18 17:55:53 +02:00
ipex.py fix incorrect output of Qwen2-7B-Instruct-GPTQ-Int4 and Qwen2-7B-Inst… (#2717) 2024-11-04 16:07:51 +01:00