Cyberes
|
fe23a2282f
|
refactor, add Llm-Disable-Openai header
|
2024-05-07 17:41:53 -06:00 |
Cyberes
|
ab408c6c5b
|
ready for public release
|
2024-03-18 12:42:44 -06:00 |
Cyberes
|
4b3e0671c6
|
clean some stuff up, bump VLLM version
|
2024-01-10 15:01:26 -07:00 |
Cyberes
|
0e7f04ab2d
|
fix gunicorn logging
|
2023-12-21 14:24:50 -07:00 |
Cyberes
|
0059e7956c
|
Merge cluster to master (#3)
Co-authored-by: Cyberes <cyberes@evulid.cc>
Reviewed-on: #3
|
2023-10-27 19:19:22 -06:00 |
Cyberes
|
8d6b2ce49c
|
minor changes, add admin token auth system, add route to get backend info
|
2023-09-24 15:54:35 -06:00 |
Cyberes
|
57ccedcfb9
|
adjust some things
|
2023-09-12 01:10:58 -06:00 |
Cyberes
|
40ac84aa9a
|
actually we don't want to emulate openai
|
2023-09-12 01:04:11 -06:00 |
Cyberes
|
bf648f605f
|
implement streaming for hf-textgen
|
2023-08-29 17:56:12 -06:00 |
Cyberes
|
441a870e85
|
calculate weighted average for stat tracking
|
2023-08-27 19:58:04 -06:00 |
Cyberes
|
6a09ffc8a4
|
log model used in request so we can pull the correct averages when we change models
|
2023-08-26 00:30:59 -06:00 |
Cyberes
|
9b7bf490a1
|
sort keys of stats dict
|
2023-08-24 18:59:52 -06:00 |
Cyberes
|
c397649097
|
restyle homepage, add config item to add content to the home page
|
2023-08-24 17:55:55 -06:00 |
Cyberes
|
763dd832cc
|
update home, update readme, calculate estimated wait based on database stats
|
2023-08-24 16:47:14 -06:00 |
Cyberes
|
21174750ea
|
update readme
|
2023-08-24 12:19:59 -06:00 |
Cyberes
|
afc138c743
|
update readme
|
2023-08-24 00:09:57 -06:00 |
Cyberes
|
bf1842f434
|
update readme
|
2023-08-23 23:48:46 -06:00 |
Cyberes
|
7d23f95d26
|
use redis caching
|
2023-08-21 23:59:50 -06:00 |
Cyberes
|
8cbf643fd3
|
MVP
|
2023-08-21 21:28:52 -06:00 |
Cyberes
|
db0dfad83d
|
Initial commit
|
2023-08-21 14:40:46 -06:00 |