OlivierDehaene
|
62f91f78ac
|
feat(server): support vectorized warpers in flash causal lm (#317)
Co-authored-by: Joel Lamy-Poirier <joel.lamy-poirier@servicenow.com>
|
2023-05-26 12:30:27 +02:00 |
OlivierDehaene
|
942005386a
|
feat(router): log input/ouput at debug level (#364)
@njhill FYI
|
2023-05-23 20:47:37 +02:00 |
OlivierDehaene
|
cfaa858070
|
feat(server): support fp16 for t5 (#360)
Fixes #349
|
2023-05-23 18:16:48 +02:00 |
OlivierDehaene
|
91d9beec90
|
fix(server): fix init for flash causal lm (#352)
Fixes #347
|
2023-05-22 15:05:32 +02:00 |
OlivierDehaene
|
5a58226130
|
fix(server): fix decode token (#334)
Fixes #333
---------
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
|
2023-05-16 23:23:27 +02:00 |
OlivierDehaene
|
dbdc587ddd
|
feat(integration-tests): improve comparison and health checks (#336)
|
2023-05-16 20:22:11 +02:00 |
OlivierDehaene
|
e71471bec9
|
feat: add snapshot testing (#282)
|
2023-05-15 23:36:30 +02:00 |