local-llm-server/llm_server/llm
Cyberes aba2e5b9c0 don't use db pooling, add LLM-ST-Errors header to disable formatted errors 2023-09-26 23:59:22 -06:00
..
oobabooga further align openai endpoint with expected responses 2023-09-24 21:45:30 -06:00
openai don't use db pooling, add LLM-ST-Errors header to disable formatted errors 2023-09-26 23:59:22 -06:00
vllm don't use db pooling, add LLM-ST-Errors header to disable formatted errors 2023-09-26 23:59:22 -06:00
__init__.py more work on openai endpoint 2023-09-26 22:09:11 -06:00
generator.py remove text-generation-inference backend 2023-09-12 13:09:47 -06:00
info.py option to disable streaming, improve timeout on requests to backend, fix error handling. reduce duplicate code, misc other cleanup 2023-09-14 14:05:50 -06:00
llm_backend.py rewrite tokenizer, restructure validation 2023-09-24 13:02:30 -06:00