OlivierDehaene
08e9181418
feat: update client to 0.7 ( #1667 )
...
Close #1652
2024-03-22 17:10:56 +01:00
drbh
de6cb15fa5
fix: improve tool type, bump pydantic and outlines ( #1650 )
...
This PR resolves a couple
- [X] adjusts the tool response to align with openai's tools response
type
- [X] bumps pydantic to `2.6.4` in all apps (resolves dependency issue
when running tests)
- [X] bump `outlines` version and fix import for new name
2024-03-21 12:45:56 -04:00
OlivierDehaene
3b56d7669b
feat: add mistral model ( #1071 )
2023-09-28 09:55:47 +02:00
Jelle Zijlstra
c8bbbd8129
chore(client): Support Pydantic 2 ( #900 )
...
This should allow users to use either Pydantic 2 or Pydantic 1.
I couldn't run all tests locally because I reran them too often and got
rate limited, but I believe this is sufficient.
2023-09-06 14:12:08 +02:00
OlivierDehaene
895c5f1562
feat(server): only compute prefill logprobs when asked ( #406 )
...
Close #288
2023-06-02 17:12:30 +02:00
OlivierDehaene
dbdc587ddd
feat(integration-tests): improve comparison and health checks ( #336 )
2023-05-16 20:22:11 +02:00
Ehsan M. Kermani
f092ba9b22
feat(server): add watermarking tests ( #248 )
2023-04-27 19:16:35 +02:00
OlivierDehaene
323546df1d
fix(python-client): add auth headers to is supported requests ( #234 )
2023-04-25 13:55:26 +02:00
OlivierDehaene
b927244eb5
feat(python-client): get list of currently deployed tgi models using the inference API ( #191 )
2023-04-17 18:43:24 +02:00
OlivierDehaene
d6a93fe992
fix(server): fix flash-neox scores warping ( #137 )
2023-03-24 18:21:41 +01:00
OlivierDehaene
5d04525cb9
feat(python-client): release v0.4.0 ( #135 )
2023-03-23 18:07:20 +01:00
OlivierDehaene
a3b7db932f
fix(python-client): relax dependencies ( #129 )
2023-03-16 12:57:07 +01:00
OlivierDehaene
d8dc8f1b0c
feat(python-client): add new parameters ( #118 )
2023-03-09 16:05:33 +01:00
OlivierDehaene
2c5df5d2af
fix(python-client): stream not set on the sync client ( #109 )
2023-03-08 16:48:16 +01:00
OlivierDehaene
0ac38d336a
feat(launcher): allow parsing num_shard from CUDA_VISIBLE_DEVICES ( #107 )
2023-03-08 11:06:59 +01:00
OlivierDehaene
3fef90d50f
feat(clients): Python client ( #103 )
2023-03-07 18:52:22 +01:00