This repository has been archived on 2024-10-27. You can view files and clone it, but cannot push or open issues or pull requests.
local-llm-server/llm_server/routes
Cyberes 4f5b2dbecb add tests 2023-10-20 12:14:34 -06:00
..
helpers option to prioritize by parameter count 2023-10-04 10:19:44 -06:00
openai add tests 2023-10-20 12:14:34 -06:00
v1 remove timed-out items from queue 2023-10-17 11:46:39 -06:00
__init__.py show total output tokens on stats 2023-08-24 20:43:11 -06:00
auth.py more work on openai endpoint 2023-09-26 22:09:11 -06:00
ooba_request_handler.py fix issues with queue and streaming 2023-10-15 20:45:01 -06:00
openai_request_handler.py get streaming working again 2023-10-16 16:22:52 -06:00
queue.py docs and stuff 2023-10-18 09:23:54 -06:00
request_handler.py change print 2023-10-18 09:12:30 -06:00
server_error.py docs and stuff 2023-10-18 09:23:54 -06:00
stats.py fix processing not being decremented on streaming, fix confusion over queue, adjust stop sequences 2023-10-02 20:53:08 -06:00