OlivierDehaene
|
f59fb8b630
|
feat(router): add ngrok integration (#453)
|
2023-06-16 16:25:11 +02:00 |
OlivierDehaene
|
e7248fe90e
|
v0.8.2
|
2023-06-01 19:49:13 +02:00 |
OlivierDehaene
|
db2ebe3947
|
v0.8.1
|
2023-05-31 12:08:40 +02:00 |
OlivierDehaene
|
081b926584
|
v0.8.0
|
2023-05-30 18:39:35 +02:00 |
OlivierDehaene
|
951930fbff
|
feat(benchmarker): add summary tables (#368)
|
2023-05-25 13:38:36 +02:00 |
OlivierDehaene
|
d31562f300
|
v0.7.0 (#353)
|
2023-05-23 21:20:49 +02:00 |
OlivierDehaene
|
e250282213
|
feat(docker): add benchmarking tool to docker image (#298)
|
2023-05-09 13:19:31 +02:00 |
Nicolas Patry
|
411b0d4e1f
|
chore(github): add templates (#264)
|
2023-05-02 15:43:19 +02:00 |
OlivierDehaene
|
6ded76a4ae
|
v0.6.0 (#222)
|
2023-04-21 21:00:57 +02:00 |
OlivierDehaene
|
709d8936f6
|
feat(router): drop requests when client closes the channel (#202)
|
2023-04-20 11:07:40 +02:00 |
OlivierDehaene
|
2475aede61
|
feat(router): add info route (#196)
close #125
|
2023-04-18 16:16:06 +02:00 |
OlivierDehaene
|
64347b05ff
|
fix(ci): fix CVE in github-slug-action (#174)
|
2023-04-13 12:43:05 +02:00 |
OlivierDehaene
|
6f0f1d70f6
|
v0.5.0 (#168)
|
2023-04-11 20:32:18 +02:00 |
OlivierDehaene
|
9987960062
|
feat(router): make router input validation optional (#164)
|
2023-04-09 20:22:27 +02:00 |
OlivierDehaene
|
fef1a1c381
|
v0.4.3 (#152)
|
2023-03-30 17:28:14 +02:00 |
OlivierDehaene
|
84722f3e33
|
v0.4.2 (#151)
|
2023-03-30 17:10:01 +02:00 |
OlivierDehaene
|
ab5fd8cf93
|
v0.4.1 (#140)
|
2023-03-26 16:37:51 +02:00 |
OlivierDehaene
|
411d6247f4
|
v0.4.0 (#119)
|
2023-03-09 16:07:01 +01:00 |
OlivierDehaene
|
1c19b0934e
|
v0.3.2 (#97)
|
2023-03-03 18:42:20 +01:00 |
OlivierDehaene
|
2d39f199ae
|
feat(server): update to hf_transfer==0.1.2 (#93)
|
2023-03-03 11:26:27 +01:00 |
OlivierDehaene
|
4e685d907e
|
feat(router): ask hf.co for pipelinetag to decide on compat_return_full_text (#89)
|
2023-02-28 10:19:32 +01:00 |
OlivierDehaene
|
4b1c9720c0
|
v0.3.1 (#84)
|
2023-02-24 13:27:41 +01:00 |
OlivierDehaene
|
6796d38c6d
|
feat(router): add cors allow origin options (#73)
|
2023-02-17 18:22:00 +01:00 |
OlivierDehaene
|
c720555adc
|
v0.3.0 (#72)
|
2023-02-16 17:28:29 +01:00 |
OlivierDehaene
|
439fcaf810
|
feat(router): add prometheus metrics scrape endpoint (#71)
|
2023-02-16 17:18:53 +01:00 |
OlivierDehaene
|
9af454142a
|
feat: add distributed tracing (#62)
|
2023-02-13 13:02:45 +01:00 |
OlivierDehaene
|
2fe5e1b30e
|
V0.2.1 (#58)
|
2023-02-07 15:40:25 +01:00 |
OlivierDehaene
|
20c3c5940c
|
feat(router): refactor API and add openAPI schemas (#53)
|
2023-02-03 12:43:37 +01:00 |
OlivierDehaene
|
017a2a8c2f
|
feat: Add token streaming using ServerSideEvents support (#41)
|
2023-01-31 17:04:00 +01:00 |
OlivierDehaene
|
54fec93193
|
fix(server): fix seeding with multiple shards (#44)
|
2023-01-31 16:01:15 +01:00 |
OlivierDehaene
|
4f9ac67cfa
|
Revert "feat: Add token streaming using ServerSideEvents support" (#40)
Reverts huggingface/text-generation-inference#36
|
2023-01-31 14:21:51 +01:00 |
OlivierDehaene
|
7fbfbb0dc5
|
feat: Add token streaming using ServerSideEvents support (#36)
Add token streaming using ServerSideEvents (SSE).
The signature of the SSE events is:
```rust
struct Details {
finish_reason: String,
generated_tokens: u32,
seed: Option<u64>,
}
struct StreamResponse {
token: Token,
generated_text: Option<String>,
details: Option<Details>,
}
struct ErrorResponse {
error: String,
}
```
|
2023-01-31 11:49:43 +01:00 |
OlivierDehaene
|
1539d3cbbe
|
feat(router): Remove second lock from batcher hot path (#27)
@njhill
|
2023-01-26 16:29:13 +01:00 |
OlivierDehaene
|
3e2e6240b8
|
feat(launcher): Add integration tests (#9)
|
2022-12-16 11:29:36 +01:00 |
OlivierDehaene
|
dccd5c2b1a
|
feat(server): Clarify CausalLMBatch concatenate method
|
2022-11-09 18:24:07 +01:00 |
OlivierDehaene
|
b3b7ea0d74
|
feat: Use json formatter by default in docker image
|
2022-11-02 17:29:56 +01:00 |
OlivierDehaene
|
3cf6368c77
|
feat(server): Support all AutoModelForCausalLM on a best effort basis
|
2022-10-28 19:24:00 +02:00 |
Olivier Dehaene
|
f16f2f5ae1
|
v0.1.0
|
2022-10-20 19:14:44 +02:00 |