This repository has been archived on 2024-10-27. You can view files and clone it, but cannot push or open issues or pull requests.
local-llm-server/llm_server/routes
Cyberes 5f4e4710c1 option to prioritize by parameter count 2023-10-04 10:19:44 -06:00
..
helpers option to prioritize by parameter count 2023-10-04 10:19:44 -06:00
openai fix processing not being decremented on streaming, fix confusion over queue, adjust stop sequences 2023-10-02 20:53:08 -06:00
v1 show online status on stats page 2023-10-03 23:39:25 -06:00
__init__.py show total output tokens on stats 2023-08-24 20:43:11 -06:00
auth.py more work on openai endpoint 2023-09-26 22:09:11 -06:00
ooba_request_handler.py finish openai endpoints 2023-10-01 16:04:53 -06:00
openai_request_handler.py do default model rather than default backend, adjust moderation endpoint logic and add timeout, exclude system tokens from recent proompters, calculate number of moderators from endpoint concurrent gens, adjust homepage 2023-10-03 13:40:08 -06:00
queue.py fix processing not being decremented on streaming, fix confusion over queue, adjust stop sequences 2023-10-02 20:53:08 -06:00
request_handler.py fix 2023-10-03 13:49:00 -06:00
server_error.py fix invalid param error, add manual model name 2023-09-12 10:30:45 -06:00
stats.py fix processing not being decremented on streaming, fix confusion over queue, adjust stop sequences 2023-10-02 20:53:08 -06:00