OlivierDehaene
|
895c5f1562
|
feat(server): only compute prefill logprobs when asked (#406)
Close #288
|
2023-06-02 17:12:30 +02:00 |
OlivierDehaene
|
b927244eb5
|
feat(python-client): get list of currently deployed tgi models using the inference API (#191)
|
2023-04-17 18:43:24 +02:00 |
OlivierDehaene
|
d8dc8f1b0c
|
feat(python-client): add new parameters (#118)
|
2023-03-09 16:05:33 +01:00 |
OlivierDehaene
|
2c5df5d2af
|
fix(python-client): stream not set on the sync client (#109)
|
2023-03-08 16:48:16 +01:00 |
OlivierDehaene
|
0ac38d336a
|
feat(launcher): allow parsing num_shard from CUDA_VISIBLE_DEVICES (#107)
|
2023-03-08 11:06:59 +01:00 |
OlivierDehaene
|
3fef90d50f
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |