Commit Graph

21 Commits

Author SHA1 Message Date
Cyberes fd09c783d3 refactor a lot of things, major cleanup, use postgresql 2024-05-07 17:03:41 -06:00
Cyberes ee9a0d4858 redo config 2024-05-07 12:20:53 -06:00
Cyberes ff82add09e redo database connection, add pooling, minor logging changes, other clean up 2024-05-07 09:48:51 -06:00
Cyberes ab408c6c5b ready for public release 2024-03-18 12:42:44 -06:00
Cyberes 4b3e0671c6 clean some stuff up, bump VLLM version 2024-01-10 15:01:26 -07:00
Cyberes 0059e7956c Merge cluster to master (#3)
Co-authored-by: Cyberes <cyberes@evulid.cc>
Reviewed-on: #3
2023-10-27 19:19:22 -06:00
Cyberes e86a5182eb redo background processes, reorganize server.py 2023-09-27 23:36:44 -06:00
Cyberes d9bbcc42e6 more work on openai endpoint 2023-09-26 22:09:11 -06:00
Cyberes d2651756df update requirements.txt 2023-09-24 21:46:48 -06:00
Cyberes 84ea2f8891 handle when auth token is not enabled 2023-09-24 15:57:39 -06:00
Cyberes 8d6b2ce49c minor changes, add admin token auth system, add route to get backend info 2023-09-24 15:54:35 -06:00
Cyberes 03e3ec5490 port to mysql, use vllm tokenizer endpoint 2023-09-20 20:30:31 -06:00
Cyberes 40ac84aa9a actually we don't want to emulate openai 2023-09-12 01:04:11 -06:00
Cyberes 4c9d543eab implement vllm backend 2023-09-11 20:47:19 -06:00
Cyberes e45eafd286 update requirements.txt 2023-08-29 17:57:06 -06:00
Cyberes bf648f605f implement streaming for hf-textgen 2023-08-29 17:56:12 -06:00
Cyberes 1a4cb5f786 reorganize stats page again 2023-08-27 22:24:44 -06:00
Cyberes 33190e3cfe fix stats for real 2023-08-23 01:14:19 -06:00
Cyberes ad9a91f1b5 concurrent gens setting, online status 2023-08-22 00:26:46 -06:00
Cyberes 6e3ddab42e fix relative paths for db path 2023-08-21 23:07:12 -06:00
Cyberes 8cbf643fd3 MVP 2023-08-21 21:28:52 -06:00