This repository has been archived on 2024-10-27. You can view files and clone it, but cannot push or open issues or pull requests.
local-llm-server/llm_server
Cyberes 2d8812a6cd fix crash again 2023-08-31 09:31:16 -06:00
..
llm still having issues 2023-08-31 09:24:37 -06:00
pages update current model when we generate_stats() 2023-08-24 21:10:00 -06:00
routes fix crash again 2023-08-31 09:31:16 -06:00
__init__.py MVP 2023-08-21 21:28:52 -06:00
config.py refactor generation route 2023-08-30 18:53:26 -06:00
database.py implement streaming for hf-textgen 2023-08-29 17:56:12 -06:00
helpers.py add HF text-generation-inference backend 2023-08-29 13:46:41 -06:00
integer.py MVP 2023-08-21 21:28:52 -06:00
netdata.py reorganize nvidia stats 2023-08-25 15:02:40 -06:00
opts.py refactor generation route 2023-08-30 18:53:26 -06:00
stream.py implement streaming for hf-textgen 2023-08-29 17:56:12 -06:00
threads.py update weighted_average_column_for_model to account for when there was an error reported, insert null for response tokens when error, correctly parse x-forwarded-for, correctly convert model reported by hf-textgen 2023-08-29 15:46:56 -06:00