local-llm-server

This repository has been archived on 2024-10-27. You can view files and clone it, but cannot push or open issues or pull requests.

History

Cyberes e1d3fca6d3 try to cancel inference if disconnected from client		2023-09-28 09:55:31 -06:00
..
config	rewrite redis usage	2023-09-28 03:44:30 -06:00
database	fix double logging	2023-09-28 01:34:15 -06:00
llm	avoid sending to backend to tokenize if it's greater than our specified context size	2023-09-28 03:54:20 -06:00
pages	update current model when we generate_stats()	2023-08-24 21:10:00 -06:00
routes	try to cancel inference if disconnected from client	2023-09-28 09:55:31 -06:00
workers	fix negative queue on stats	2023-09-28 08:47:39 -06:00
__init__.py	MVP	2023-08-21 21:28:52 -06:00
helpers.py	divide by 0???	2023-09-28 03:46:01 -06:00
integer.py	MVP	2023-08-21 21:28:52 -06:00
netdata.py	option to disable streaming, improve timeout on requests to backend, fix error handling. reduce duplicate code, misc other cleanup	2023-09-14 14:05:50 -06:00
opts.py	more work on openai endpoint	2023-09-26 22:09:11 -06:00
pre_fork.py	redo background processes, reorganize server.py	2023-09-27 23:36:44 -06:00
stream.py	implement streaming for hf-textgen	2023-08-29 17:56:12 -06:00