OlivierDehaene
|
ece7ffa40a
|
feat(server): improve flash attention import errors (#465)
@lewtun, is this enough?
Closes #458
Closes #456
|
2023-06-19 09:53:45 +02:00 |
OlivierDehaene
|
6abec14a7e
|
feat(server): batch tokenization for flash causal lm (#411)
|
2023-06-05 16:09:41 +02:00 |
OlivierDehaene
|
87dc034b59
|
feat(server): add retry on download (#384)
|
2023-05-31 10:57:53 +02:00 |
OlivierDehaene
|
85aa7e2e7b
|
feat(server): support hf endpoint weight layout (#266)
|
2023-05-03 11:36:24 +02:00 |
OlivierDehaene
|
f26dfd0dc1
|
feat(server): support OPT models (#55)
OPT models do not all have a `tokenizer.json` file on the hub at the
moment. Can't merge for now.
|
2023-04-11 19:16:41 +02:00 |
OlivierDehaene
|
3fef90d50f
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |