Cyberes
|
fd09c783d3
|
refactor a lot of things, major cleanup, use postgresql
|
2024-05-07 17:03:41 -06:00 |
Cyberes
|
ee9a0d4858
|
redo config
|
2024-05-07 12:20:53 -06:00 |
Cyberes
|
ff82add09e
|
redo database connection, add pooling, minor logging changes, other clean up
|
2024-05-07 09:48:51 -06:00 |
Cyberes
|
ab408c6c5b
|
ready for public release
|
2024-03-18 12:42:44 -06:00 |
Cyberes
|
4b3e0671c6
|
clean some stuff up, bump VLLM version
|
2024-01-10 15:01:26 -07:00 |
Cyberes
|
0059e7956c
|
Merge cluster to master (#3)
Co-authored-by: Cyberes <cyberes@evulid.cc>
Reviewed-on: #3
|
2023-10-27 19:19:22 -06:00 |
Cyberes
|
e86a5182eb
|
redo background processes, reorganize server.py
|
2023-09-27 23:36:44 -06:00 |
Cyberes
|
d9bbcc42e6
|
more work on openai endpoint
|
2023-09-26 22:09:11 -06:00 |
Cyberes
|
d2651756df
|
update requirements.txt
|
2023-09-24 21:46:48 -06:00 |
Cyberes
|
84ea2f8891
|
handle when auth token is not enabled
|
2023-09-24 15:57:39 -06:00 |
Cyberes
|
8d6b2ce49c
|
minor changes, add admin token auth system, add route to get backend info
|
2023-09-24 15:54:35 -06:00 |
Cyberes
|
03e3ec5490
|
port to mysql, use vllm tokenizer endpoint
|
2023-09-20 20:30:31 -06:00 |
Cyberes
|
40ac84aa9a
|
actually we don't want to emulate openai
|
2023-09-12 01:04:11 -06:00 |
Cyberes
|
4c9d543eab
|
implement vllm backend
|
2023-09-11 20:47:19 -06:00 |
Cyberes
|
e45eafd286
|
update requirements.txt
|
2023-08-29 17:57:06 -06:00 |
Cyberes
|
bf648f605f
|
implement streaming for hf-textgen
|
2023-08-29 17:56:12 -06:00 |
Cyberes
|
1a4cb5f786
|
reorganize stats page again
|
2023-08-27 22:24:44 -06:00 |
Cyberes
|
33190e3cfe
|
fix stats for real
|
2023-08-23 01:14:19 -06:00 |
Cyberes
|
ad9a91f1b5
|
concurrent gens setting, online status
|
2023-08-22 00:26:46 -06:00 |
Cyberes
|
6e3ddab42e
|
fix relative paths for db path
|
2023-08-21 23:07:12 -06:00 |
Cyberes
|
8cbf643fd3
|
MVP
|
2023-08-21 21:28:52 -06:00 |