OlivierDehaene
|
895c5f1562
|
feat(server): only compute prefill logprobs when asked (#406)
Close #288
|
2023-06-02 17:12:30 +02:00 |
OlivierDehaene
|
87dc034b59
|
feat(server): add retry on download (#384)
|
2023-05-31 10:57:53 +02:00 |
OlivierDehaene
|
444400b457
|
increase health checks
|
2023-05-31 10:55:59 +02:00 |
OlivierDehaene
|
b8b950b37c
|
feat(server): support RefinedWeb models (#379)
|
2023-05-30 18:25:19 +02:00 |
OlivierDehaene
|
62f91f78ac
|
feat(server): support vectorized warpers in flash causal lm (#317)
Co-authored-by: Joel Lamy-Poirier <joel.lamy-poirier@servicenow.com>
|
2023-05-26 12:30:27 +02:00 |
OlivierDehaene
|
cfaa858070
|
feat(server): support fp16 for t5 (#360)
Fixes #349
|
2023-05-23 18:16:48 +02:00 |
OlivierDehaene
|
91d9beec90
|
fix(server): fix init for flash causal lm (#352)
Fixes #347
|
2023-05-22 15:05:32 +02:00 |
OlivierDehaene
|
5a58226130
|
fix(server): fix decode token (#334)
Fixes #333
---------
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
|
2023-05-16 23:23:27 +02:00 |
OlivierDehaene
|
dbdc587ddd
|
feat(integration-tests): improve comparison and health checks (#336)
|
2023-05-16 20:22:11 +02:00 |
OlivierDehaene
|
e71471bec9
|
feat: add snapshot testing (#282)
|
2023-05-15 23:36:30 +02:00 |