This repository has been archived on 2024-10-27. You can view files and clone it, but cannot push or open issues or pull requests.
local-llm-server/llm_server
Cyberes 6e74ce7c28 fix old code in completions 2023-10-19 17:59:27 -06:00
..
cluster use backend handler to build parameters when sending test prompt 2023-10-17 12:42:48 -06:00
config fix the queue?? 2023-10-05 21:37:18 -06:00
database get streaming working again 2023-10-16 16:22:52 -06:00
llm remove timed-out items from queue 2023-10-17 11:46:39 -06:00
pages
routes fix old code in completions 2023-10-19 17:59:27 -06:00
workers refer to queue for tracking IP count rather than seperate value 2023-10-18 09:03:10 -06:00
__init__.py
custom_redis.py docs and stuff 2023-10-18 09:23:54 -06:00
helpers.py add length penalty param to vllm 2023-10-11 12:22:50 -06:00
messages.py trying to fix workers still processing after backend goes offline 2023-10-15 15:11:37 -06:00
opts.py don't pickle streaming 2023-10-16 18:35:10 -06:00
pre_fork.py
sock.py get streaming working again 2023-10-16 16:22:52 -06:00