hf_text-generation-inference/server/text_generation_server/models
Mohit Sharma 7cb49f6f4f float16 dep 2024-09-27 15:53:44 +00:00
..
custom_modeling float16 dep 2024-09-27 15:53:44 +00:00
__init__.py Add support for scalar FP8 weight scales (#2550) 2024-09-24 13:57:40 +02:00
bloom.py
causal_lm.py Fixing exl2 and other quanize tests again. (#2419) 2024-08-15 11:12:51 +02:00
flash_causal_lm.py euff 2024-09-18 12:03:52 +00:00
galactica.py
globals.py addressed review comments 2024-09-27 10:28:37 +00:00
idefics.py Upgrading exl2. (#2415) 2024-08-14 11:58:08 +02:00
idefics_causal_lm.py Upgrading exl2. (#2415) 2024-08-14 11:58:08 +02:00
mamba.py Fixing exl2 and other quanize tests again. (#2419) 2024-08-15 11:12:51 +02:00
model.py
pali_gemma.py
seq2seq_lm.py Fixing exl2 and other quanize tests again. (#2419) 2024-08-15 11:12:51 +02:00
types.py
vlm_causal_lm.py Prefix test - Different kind of load test to trigger prefix test bugs. (#2490) 2024-09-11 18:10:40 +02:00