OlivierDehaene
|
895c5f1562
|
feat(server): only compute prefill logprobs when asked (#406)
Close #288
|
2023-06-02 17:12:30 +02:00 |
OlivierDehaene
|
dbdc587ddd
|
feat(integration-tests): improve comparison and health checks (#336)
|
2023-05-16 20:22:11 +02:00 |
Ehsan M. Kermani
|
f092ba9b22
|
feat(server): add watermarking tests (#248)
|
2023-04-27 19:16:35 +02:00 |
OlivierDehaene
|
323546df1d
|
fix(python-client): add auth headers to is supported requests (#234)
|
2023-04-25 13:55:26 +02:00 |
OlivierDehaene
|
b927244eb5
|
feat(python-client): get list of currently deployed tgi models using the inference API (#191)
|
2023-04-17 18:43:24 +02:00 |
OlivierDehaene
|
d6a93fe992
|
fix(server): fix flash-neox scores warping (#137)
|
2023-03-24 18:21:41 +01:00 |
OlivierDehaene
|
5d04525cb9
|
feat(python-client): release v0.4.0 (#135)
|
2023-03-23 18:07:20 +01:00 |
OlivierDehaene
|
a3b7db932f
|
fix(python-client): relax dependencies (#129)
|
2023-03-16 12:57:07 +01:00 |
OlivierDehaene
|
d8dc8f1b0c
|
feat(python-client): add new parameters (#118)
|
2023-03-09 16:05:33 +01:00 |
OlivierDehaene
|
2c5df5d2af
|
fix(python-client): stream not set on the sync client (#109)
|
2023-03-08 16:48:16 +01:00 |
OlivierDehaene
|
0ac38d336a
|
feat(launcher): allow parsing num_shard from CUDA_VISIBLE_DEVICES (#107)
|
2023-03-08 11:06:59 +01:00 |
OlivierDehaene
|
3fef90d50f
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |