This repository has been archived on 2024-10-27. You can view files and clone it, but cannot push or open issues or pull requests.
local-llm-server/llm_server/routes
Cyberes 0771c2325c fix inference workers quitting when a backend is offline, start adding logging, improve tokenizer error handling 2023-10-23 17:24:20 -06:00
..
helpers fix keyerror? 2023-10-20 14:00:24 -06:00
openai fix inference workers quitting when a backend is offline, start adding logging, improve tokenizer error handling 2023-10-23 17:24:20 -06:00
v1 fix inference workers quitting when a backend is offline, start adding logging, improve tokenizer error handling 2023-10-23 17:24:20 -06:00
__init__.py show total output tokens on stats 2023-08-24 20:43:11 -06:00
auth.py more work on openai endpoint 2023-09-26 22:09:11 -06:00
ooba_request_handler.py fix inference workers quitting when a backend is offline, start adding logging, improve tokenizer error handling 2023-10-23 17:24:20 -06:00
openai_request_handler.py fix inference workers quitting when a backend is offline, start adding logging, improve tokenizer error handling 2023-10-23 17:24:20 -06:00
queue.py fix inference workers quitting when a backend is offline, start adding logging, improve tokenizer error handling 2023-10-23 17:24:20 -06:00
request_handler.py fix inference workers quitting when a backend is offline, start adding logging, improve tokenizer error handling 2023-10-23 17:24:20 -06:00
server_error.py fix inference workers quitting when a backend is offline, start adding logging, improve tokenizer error handling 2023-10-23 17:24:20 -06:00
stats.py fix processing not being decremented on streaming, fix confusion over queue, adjust stop sequences 2023-10-02 20:53:08 -06:00