local-llm-server

Author	SHA1	Message	Date
Cyberes	e7b57cad7b	set up cluster config and basic background workers	2023-09-28 18:40:24 -06:00
Cyberes	d9bbcc42e6	more work on openai endpoint	2023-09-26 22:09:11 -06:00
Cyberes	8d6b2ce49c	minor changes, add admin token auth system, add route to get backend info	2023-09-24 15:54:35 -06:00
Cyberes	12e894032e	show the openai system prompt	2023-09-13 20:25:56 -06:00
Cyberes	6152b1bb66	fix invalid param error, add manual model name	2023-09-12 10:30:45 -06:00
Cyberes	40ac84aa9a	actually we don't want to emulate openai	2023-09-12 01:04:11 -06:00
Cyberes	4c9d543eab	implement vllm backend	2023-09-11 20:47:19 -06:00
Cyberes	2816c01902	refactor generation route	2023-08-30 18:53:26 -06:00
Cyberes	a79d67adbb	do caching ourself on /model	2023-08-23 16:40:20 -06:00
Cyberes	64e1b1654f	more cloudflare finicky stuff	2023-08-23 16:32:13 -06:00
Cyberes	f76d7bbc5d	more caching stuff	2023-08-23 16:23:24 -06:00
Cyberes	a6b0bb0183	actually we want 500	2023-08-23 16:09:36 -06:00
Cyberes	508089ce11	model info timeout and additional info	2023-08-23 16:07:43 -06:00
Cyberes	1f5e2da637	print fetch model error message	2023-08-23 16:02:57 -06:00
Cyberes	806073ee4c	update config	2023-08-23 15:23:06 -06:00
Cyberes	ba063f7f1b	caching	2023-08-23 12:40:13 -06:00
Cyberes	0d32db2dbd	prototype hf-textgen and adjust logging	2023-08-22 19:58:31 -06:00
Cyberes	e04d6a8a13	minor adjustments	2023-08-21 22:49:44 -06:00
Cyberes	8cbf643fd3	MVP	2023-08-21 21:28:52 -06:00