Commit Graph

20 Commits

Author SHA1 Message Date
Cyberes fe23a2282f refactor, add Llm-Disable-Openai header 2024-05-07 17:41:53 -06:00
Cyberes ab408c6c5b ready for public release 2024-03-18 12:42:44 -06:00
Cyberes 4b3e0671c6 clean some stuff up, bump VLLM version 2024-01-10 15:01:26 -07:00
Cyberes 0e7f04ab2d fix gunicorn logging 2023-12-21 14:24:50 -07:00
Cyberes 0059e7956c Merge cluster to master (#3)
Co-authored-by: Cyberes <cyberes@evulid.cc>
Reviewed-on: #3
2023-10-27 19:19:22 -06:00
Cyberes 8d6b2ce49c minor changes, add admin token auth system, add route to get backend info 2023-09-24 15:54:35 -06:00
Cyberes 57ccedcfb9 adjust some things 2023-09-12 01:10:58 -06:00
Cyberes 40ac84aa9a actually we don't want to emulate openai 2023-09-12 01:04:11 -06:00
Cyberes bf648f605f implement streaming for hf-textgen 2023-08-29 17:56:12 -06:00
Cyberes 441a870e85 calculate weighted average for stat tracking 2023-08-27 19:58:04 -06:00
Cyberes 6a09ffc8a4 log model used in request so we can pull the correct averages when we change models 2023-08-26 00:30:59 -06:00
Cyberes 9b7bf490a1 sort keys of stats dict 2023-08-24 18:59:52 -06:00
Cyberes c397649097 restyle homepage, add config item to add content to the home page 2023-08-24 17:55:55 -06:00
Cyberes 763dd832cc update home, update readme, calculate estimated wait based on database stats 2023-08-24 16:47:14 -06:00
Cyberes 21174750ea update readme 2023-08-24 12:19:59 -06:00
Cyberes afc138c743 update readme 2023-08-24 00:09:57 -06:00
Cyberes bf1842f434 update readme 2023-08-23 23:48:46 -06:00
Cyberes 7d23f95d26 use redis caching 2023-08-21 23:59:50 -06:00
Cyberes 8cbf643fd3 MVP 2023-08-21 21:28:52 -06:00
Cyberes db0dfad83d Initial commit 2023-08-21 14:40:46 -06:00