This repository has been archived on 2024-10-27. You can view files and clone it, but cannot push or open issues or pull requests.
local-llm-server/llm_server/llm
Cyberes 1646a00987 implement streaming on openai, improve streaming, run DB logging in background thread 2023-09-25 12:30:40 -06:00
..
oobabooga further align openai endpoint with expected responses 2023-09-24 21:45:30 -06:00
vllm implement streaming on openai, improve streaming, run DB logging in background thread 2023-09-25 12:30:40 -06:00
__init__.py rewrite tokenizer, restructure validation 2023-09-24 13:02:30 -06:00
generator.py remove text-generation-inference backend 2023-09-12 13:09:47 -06:00
info.py option to disable streaming, improve timeout on requests to backend, fix error handling. reduce duplicate code, misc other cleanup 2023-09-14 14:05:50 -06:00
llm_backend.py rewrite tokenizer, restructure validation 2023-09-24 13:02:30 -06:00