This repository has been archived on 2024-10-27. You can view files and clone it, but cannot push or open issues or pull requests.
local-llm-server/llm_server/workers
Cyberes 563630547a add robots.txt 2023-10-23 17:32:33 -06:00
..
__init__.py add ratelimiting to websocket streaming endpoint, fix queue not decrementing IP requests, add console printer 2023-09-27 21:15:54 -06:00
cleaner.py begin streaming rewrite 2023-10-16 00:18:05 -06:00
inferencer.py add robots.txt 2023-10-23 17:32:33 -06:00
logger.py clean up 2023-10-11 18:04:15 -06:00
mainer.py remove timed-out items from queue 2023-10-17 11:46:39 -06:00
moderator.py fix inference workers quitting when a backend is offline, start adding logging, improve tokenizer error handling 2023-10-23 17:24:20 -06:00
printer.py fix inference workers quitting when a backend is offline, start adding logging, improve tokenizer error handling 2023-10-23 17:24:20 -06:00
recenter.py mvp 2023-09-29 00:09:44 -06:00
threader.py fix inference workers quitting when a backend is offline, start adding logging, improve tokenizer error handling 2023-10-23 17:24:20 -06:00