hf_text-generation-inference/server/text_generation_server/models/custom_modeling
xiaobin 4cce84301b
fit for baichuan models (#981)
As more and more people begin to use Baichuan's open-source models, the
influence of Baichuan models is growing, especially in China. Many
community members are interested in adding support for Baichuan models
to TGI. Meanwhile, Baichuan is a very open company, and in the future,
it plans to open-source more and more models, taking all this into
consideration, we would like to add support for the Baichuan model to
TGI. To do this, we need to make some changes, which we hope can be
merged into the main branch of TGI. In the future, we would be happy to
help maintain support for Baichuan models in TGI. We sincerely hope that
our pull request can be accepted. Thank you.

By the way, the changes of this time mainly for supporting Baichuan-7B.

---------

Co-authored-by: xiaoyuze <xiaoyuze@baichuan.com>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
2023-09-08 16:51:34 +02:00
..
__init__.py feat(server): flash santacoder (#153) 2023-04-03 19:06:42 +02:00
bloom_modeling.py feat: better errors for warmup and TP (#575) 2023-07-10 14:47:15 +02:00
flash_llama_modeling.py fit for baichuan models (#981) 2023-09-08 16:51:34 +02:00
flash_neox_modeling.py Adding Rope scaling. (#741) 2023-07-31 15:38:47 +02:00
flash_rw_modeling.py Fix f180 (#951) 2023-08-30 11:09:46 +02:00
flash_santacoder_modeling.py feat(server): Using `quantize_config.json` instead of GPTQ_BITS env variables. (#671) 2023-07-25 13:00:27 +02:00
idefics_config.py small fix on idefics (#954) 2023-09-01 18:44:34 +02:00
idefics_image_processing.py Adding Idefics multi modal model. (#842) 2023-08-17 14:38:49 +02:00
idefics_modeling.py Adding Idefics multi modal model. (#842) 2023-08-17 14:38:49 +02:00
idefics_perceiver.py Adding Idefics multi modal model. (#842) 2023-08-17 14:38:49 +02:00
idefics_processing.py Adding Idefics multi modal model. (#842) 2023-08-17 14:38:49 +02:00
idefics_vision.py Adding Idefics multi modal model. (#842) 2023-08-17 14:38:49 +02:00
mpt_modeling.py chore: fix typo in mpt_modeling.py (#737) 2023-07-31 15:43:44 +02:00
neox_modeling.py feat: better errors for warmup and TP (#575) 2023-07-10 14:47:15 +02:00
opt_modeling.py add FastLinear import (#750) 2023-08-02 20:04:46 +02:00
t5_modeling.py fix(server): Adding logger import to t5_modeling.py (#585) 2023-07-12 10:40:32 +02:00