local-llm-server

History

Cyberes 0015e653b2 adjust a few final things		2023-09-23 22:30:59 -06:00
..
helpers	implement streaming for vllm	2023-09-23 17:57:23 -06:00
openai	cache stats in background	2023-09-17 18:55:36 -06:00
v1	adjust a few final things	2023-09-23 22:30:59 -06:00
__init__.py	show total output tokens on stats	2023-08-24 20:43:11 -06:00
cache.py	check if the backend crapped out, print some more stuff	2023-09-14 14:26:25 -06:00
ooba_request_handler.py	port to mysql, use vllm tokenizer endpoint	2023-09-20 20:30:31 -06:00
openai_request_handler.py	port to mysql, use vllm tokenizer endpoint	2023-09-20 20:30:31 -06:00
queue.py	set up queue to work with gunicorn processes, other improvements	2023-09-14 17:38:20 -06:00
request_handler.py	implement streaming for vllm	2023-09-23 17:57:23 -06:00
server_error.py	fix invalid param error, add manual model name	2023-09-12 10:30:45 -06:00
stats.py	change proompters 1 min to 5 min	2023-09-20 21:21:22 -06:00