local-llm-server/llm_server/routes
Cyberes 0015e653b2 adjust a few final things 2023-09-23 22:30:59 -06:00
..
helpers implement streaming for vllm 2023-09-23 17:57:23 -06:00
openai cache stats in background 2023-09-17 18:55:36 -06:00
v1 adjust a few final things 2023-09-23 22:30:59 -06:00
__init__.py show total output tokens on stats 2023-08-24 20:43:11 -06:00
cache.py check if the backend crapped out, print some more stuff 2023-09-14 14:26:25 -06:00
ooba_request_handler.py port to mysql, use vllm tokenizer endpoint 2023-09-20 20:30:31 -06:00
openai_request_handler.py port to mysql, use vllm tokenizer endpoint 2023-09-20 20:30:31 -06:00
queue.py set up queue to work with gunicorn processes, other improvements 2023-09-14 17:38:20 -06:00
request_handler.py implement streaming for vllm 2023-09-23 17:57:23 -06:00
server_error.py fix invalid param error, add manual model name 2023-09-12 10:30:45 -06:00
stats.py change proompters 1 min to 5 min 2023-09-20 21:21:22 -06:00