local-llm-server

This repository has been archived on 2024-10-27. You can view files and clone it, but cannot push or open issues or pull requests.

History

Cyberes 94e845cd1a if there's less than num concurrent wait time is 0		2023-09-23 21:09:21 -06:00
..
database	fix two exceptions	2023-09-23 20:55:49 -06:00
llm	implement streaming for vllm	2023-09-23 17:57:23 -06:00
pages	update current model when we generate_stats()	2023-08-24 21:10:00 -06:00
routes	if there's less than num concurrent wait time is 0	2023-09-23 21:09:21 -06:00
__init__.py	MVP	2023-08-21 21:28:52 -06:00
config.py	add moderation endpoint to openai api, update config	2023-09-14 15:07:17 -06:00
helpers.py	fix json error	2023-09-20 21:14:12 -06:00
integer.py	MVP	2023-08-21 21:28:52 -06:00
netdata.py	option to disable streaming, improve timeout on requests to backend, fix error handling. reduce duplicate code, misc other cleanup	2023-09-14 14:05:50 -06:00
opts.py	port to mysql, use vllm tokenizer endpoint	2023-09-20 20:30:31 -06:00
stream.py	implement streaming for hf-textgen	2023-08-29 17:56:12 -06:00
threads.py	fix two exceptions	2023-09-23 20:55:49 -06:00