This repository has been archived on 2024-10-27. You can view files and clone it, but cannot push or open issues or pull requests.
local-llm-server/llm_server/workers
Cyberes f88e2362c5 remove some debug prints 2023-10-03 20:01:28 -06:00
..
__init__.py add ratelimiting to websocket streaming endpoint, fix queue not decrementing IP requests, add console printer 2023-09-27 21:15:54 -06:00
inferencer.py fix processing not being decremented on streaming, fix confusion over queue, adjust stop sequences 2023-10-02 20:53:08 -06:00
mainer.py cache the home page in the background 2023-09-30 23:03:42 -06:00
moderator.py remove some debug prints 2023-10-03 20:01:28 -06:00
printer.py functional 2023-09-30 19:41:50 -06:00
recenter.py mvp 2023-09-29 00:09:44 -06:00
threader.py do default model rather than default backend, adjust moderation endpoint logic and add timeout, exclude system tokens from recent proompters, calculate number of moderators from endpoint concurrent gens, adjust homepage 2023-10-03 13:40:08 -06:00