This repository has been archived on 2024-10-27. You can view files and clone it, but cannot push or open issues or pull requests.
local-llm-server/llm_server/routes
Cyberes 50377eca22 track lag on get_ip_request_count() 2023-10-18 09:09:22 -06:00
..
helpers option to prioritize by parameter count 2023-10-04 10:19:44 -06:00
openai test 2023-10-17 12:32:41 -06:00
v1 remove timed-out items from queue 2023-10-17 11:46:39 -06:00
__init__.py show total output tokens on stats 2023-08-24 20:43:11 -06:00
auth.py more work on openai endpoint 2023-09-26 22:09:11 -06:00
ooba_request_handler.py fix issues with queue and streaming 2023-10-15 20:45:01 -06:00
openai_request_handler.py get streaming working again 2023-10-16 16:22:52 -06:00
queue.py track lag on get_ip_request_count() 2023-10-18 09:09:22 -06:00
request_handler.py remove timed-out items from queue 2023-10-17 11:46:39 -06:00
server_error.py fix invalid param error, add manual model name 2023-09-12 10:30:45 -06:00
stats.py fix processing not being decremented on streaming, fix confusion over queue, adjust stop sequences 2023-10-02 20:53:08 -06:00