This repository has been archived on 2024-10-27. You can view files and clone it, but cannot push or open issues or pull requests.
local-llm-server/llm_server
Cyberes e1d3fca6d3 try to cancel inference if disconnected from client 2023-09-28 09:55:31 -06:00
..
config rewrite redis usage 2023-09-28 03:44:30 -06:00
database fix double logging 2023-09-28 01:34:15 -06:00
llm avoid sending to backend to tokenize if it's greater than our specified context size 2023-09-28 03:54:20 -06:00
pages
routes try to cancel inference if disconnected from client 2023-09-28 09:55:31 -06:00
workers fix negative queue on stats 2023-09-28 08:47:39 -06:00
__init__.py
helpers.py divide by 0??? 2023-09-28 03:46:01 -06:00
integer.py
netdata.py
opts.py
pre_fork.py
stream.py