This repository has been archived on 2024-10-27. You can view files and clone it, but cannot push or open issues or pull requests.
local-llm-server/llm_server
Cyberes 11e84db59c update database, tokenizer handle null prompt, convert top_p to vllm on openai, actually validate prompt on streaming, 2023-09-25 22:32:48 -06:00
..
database update database, tokenizer handle null prompt, convert top_p to vllm on openai, actually validate prompt on streaming, 2023-09-25 22:32:48 -06:00
llm update database, tokenizer handle null prompt, convert top_p to vllm on openai, actually validate prompt on streaming, 2023-09-25 22:32:48 -06:00
pages update current model when we generate_stats() 2023-08-24 21:10:00 -06:00
routes update database, tokenizer handle null prompt, convert top_p to vllm on openai, actually validate prompt on streaming, 2023-09-25 22:32:48 -06:00
__init__.py MVP 2023-08-21 21:28:52 -06:00
config.py openai_force_no_hashes 2023-09-25 22:01:57 -06:00
helpers.py fix homepage slowness, fix incorrect 24 hr prompters, fix redis wrapper, 2023-09-25 17:20:21 -06:00
integer.py MVP 2023-08-21 21:28:52 -06:00
netdata.py option to disable streaming, improve timeout on requests to backend, fix error handling. reduce duplicate code, misc other cleanup 2023-09-14 14:05:50 -06:00
opts.py openai_force_no_hashes 2023-09-25 22:01:57 -06:00
stream.py implement streaming for hf-textgen 2023-08-29 17:56:12 -06:00
threads.py fix homepage slowness, fix incorrect 24 hr prompters, fix redis wrapper, 2023-09-25 17:20:21 -06:00