local-llm-server

History

Cyberes aba2e5b9c0 don't use db pooling, add LLM-ST-Errors header to disable formatted errors		2023-09-26 23:59:22 -06:00
..
oobabooga	further align openai endpoint with expected responses	2023-09-24 21:45:30 -06:00
openai	don't use db pooling, add LLM-ST-Errors header to disable formatted errors	2023-09-26 23:59:22 -06:00
vllm	don't use db pooling, add LLM-ST-Errors header to disable formatted errors	2023-09-26 23:59:22 -06:00
__init__.py	more work on openai endpoint	2023-09-26 22:09:11 -06:00
generator.py	remove text-generation-inference backend	2023-09-12 13:09:47 -06:00
info.py	option to disable streaming, improve timeout on requests to backend, fix error handling. reduce duplicate code, misc other cleanup	2023-09-14 14:05:50 -06:00
llm_backend.py	rewrite tokenizer, restructure validation	2023-09-24 13:02:30 -06:00