hf_text-generation-inference/server/tests/models
OlivierDehaene ebc74d5666
feat(router): use number of tokens in batch as input for dynamic batching (#226)
Co-authored-by: Nick Hill <nickhill@us.ibm.com>
2023-04-24 17:59:00 +02:00
..
test_bloom.py feat(router): use number of tokens in batch as input for dynamic batching (#226) 2023-04-24 17:59:00 +02:00
test_causal_lm.py feat(router): use number of tokens in batch as input for dynamic batching (#226) 2023-04-24 17:59:00 +02:00
test_santacoder.py feat(router): make router input validation optional (#164) 2023-04-09 20:22:27 +02:00
test_seq2seq_lm.py feat(server): reduce memory requirement (#214) 2023-04-24 14:15:42 +02:00