local-llm-server

This repository has been archived on 2024-10-27. You can view files and clone it, but cannot push or open issues or pull requests.

History

Cyberes 1646a00987 implement streaming on openai, improve streaming, run DB logging in background thread		2023-09-25 12:30:40 -06:00
..
oobabooga	further align openai endpoint with expected responses	2023-09-24 21:45:30 -06:00
vllm	implement streaming on openai, improve streaming, run DB logging in background thread	2023-09-25 12:30:40 -06:00
__init__.py	rewrite tokenizer, restructure validation	2023-09-24 13:02:30 -06:00
generator.py	remove text-generation-inference backend	2023-09-12 13:09:47 -06:00
info.py	option to disable streaming, improve timeout on requests to backend, fix error handling. reduce duplicate code, misc other cleanup	2023-09-14 14:05:50 -06:00
llm_backend.py	rewrite tokenizer, restructure validation	2023-09-24 13:02:30 -06:00