Merge cluster to master #3

Merged
cyberes merged 163 commits from cluster into master 2023-10-27 19:19:22 -06:00

163 Commits

Author SHA1 Message Date
Cyberes ee44371fdf Merge branch 'master' into cluster 2023-10-27 19:05:27 -06:00
Cyberes 28c250385d add todo 2023-10-27 19:00:49 -06:00
Cyberes 563630547a add robots.txt 2023-10-23 17:32:33 -06:00
Cyberes b4e01e129d fix when all offline 2023-10-23 17:28:59 -06:00
Cyberes 96ba48affc make sure to regen stats on startup 2023-10-23 17:26:15 -06:00
Cyberes 177dabd209 Give some time for the background threads to get themselves ready to go 2023-10-23 17:25:38 -06:00
Cyberes 0771c2325c fix inference workers quitting when a backend is offline, start adding logging, improve tokenizer error handling 2023-10-23 17:24:20 -06:00
Cyberes 3cf73fec9b fix a few exceptions when all backends go offline 2023-10-23 15:22:57 -06:00
Cyberes d43f110a14 fix redis cycle and add no reset to daemon 2023-10-22 12:19:20 -06:00
Cyberes e236e93a79 clean up a bit 2023-10-20 18:26:41 -06:00
Cyberes f39e976b34 dameon printer: Calculate the queue size the same way it's done on the stats 2023-10-20 17:41:47 -06:00
Cyberes 1a15232400 tests: make sure all prompts are the same 2023-10-20 17:35:21 -06:00
Cyberes 763139c949 fix keyerror 2023-10-20 14:02:30 -06:00
Cyberes e838f591aa fix keyerror? 2023-10-20 14:00:24 -06:00
Cyberes 0abd4b94fb track down keyerror 2023-10-20 13:57:20 -06:00
Cyberes 4f5b2dbecb add tests 2023-10-20 12:14:34 -06:00
Cyberes 6e74ce7c28 fix old code in completions 2023-10-19 17:59:27 -06:00
Cyberes b9566e9db7 docs and stuff 2023-10-18 09:23:54 -06:00
Cyberes 56a2ca464b change print 2023-10-18 09:12:30 -06:00
Cyberes 50377eca22 track lag on get_ip_request_count() 2023-10-18 09:09:22 -06:00
Cyberes 92e4ecd8a1 refer to queue for tracking IP count rather than seperate value 2023-10-18 09:03:10 -06:00
Cyberes be03569165 use backend handler to build parameters when sending test prompt 2023-10-17 12:42:48 -06:00
Cyberes 90adffaec8 test 2023-10-17 12:32:41 -06:00
Cyberes 4c2c164ce1 test 2023-10-17 12:29:12 -06:00
Cyberes 2fed87d340 remove timed-out items from queue 2023-10-17 11:46:39 -06:00
Cyberes 7998cfca87 cleanup 2023-10-16 23:47:34 -06:00
Cyberes 2ed0e01db6 background thread 2023-10-16 23:44:11 -06:00
Cyberes 6f65791795 adjust 2023-10-16 23:40:07 -06:00
Cyberes 9e3cbc9d2e fix streaming slowdown? 2023-10-16 23:36:25 -06:00
Cyberes c3c053e071 test 2023-10-16 23:29:17 -06:00
Cyberes 70cf6843e5 update requiorements 2023-10-16 23:12:41 -06:00
Cyberes 806e522d16 don't pickle streaming 2023-10-16 18:35:10 -06:00
Cyberes 81baf9616f revert 2023-10-16 18:16:19 -06:00
Cyberes 21755450a3 test 2023-10-16 18:10:21 -06:00
Cyberes 1e68e10b62 fix GeneratorExit 2023-10-16 18:04:49 -06:00
Cyberes 20047fa0e4 2000 chunk size 2023-10-16 18:01:17 -06:00
Cyberes 19a193b792 increase tokenization chunk size 2023-10-16 17:59:21 -06:00
Cyberes f421436048 add nginx config 2023-10-16 17:43:37 -06:00
Cyberes 2c7773cc4f get streaming working again 2023-10-16 16:22:52 -06:00
Cyberes 151b3e4769 begin streaming rewrite 2023-10-16 00:18:05 -06:00
Cyberes 24aab3cd93 fix streaming disabled 2023-10-15 20:59:11 -06:00
Cyberes 381bdb950f remove debug print 2023-10-15 20:46:32 -06:00
Cyberes 31ab4188f1 fix issues with queue and streaming 2023-10-15 20:45:01 -06:00
Cyberes 3ec9b2347f fix wrong datatype 2023-10-15 17:24:18 -06:00
Cyberes b3f0c4b28f remove debug print 2023-10-15 15:14:32 -06:00
Cyberes 83f3ba8919 trying to fix workers still processing after backend goes offline 2023-10-15 15:11:37 -06:00
Cyberes ca7044bc90 update gradio chat 2023-10-11 18:49:31 -06:00
Cyberes 4e3985e156 fix wrong status code on openai streaming 2023-10-11 18:17:02 -06:00
Cyberes 74cf8f309b clean up 2023-10-11 18:04:15 -06:00
Cyberes 169e216a38 add background thread to gradio 2023-10-11 12:56:04 -06:00
Cyberes 69b8c1e35c fix openai confusion 2023-10-11 12:50:20 -06:00
Cyberes 1d1c45dc1a add length penalty param to vllm 2023-10-11 12:22:50 -06:00
Cyberes 78114771b0 fix oai exception 2023-10-11 09:20:00 -06:00
Cyberes 7286e38cb0 t 2023-10-11 09:10:32 -06:00
Cyberes f4e5b5275d test 2023-10-11 09:09:41 -06:00
Cyberes 18e37a72ae add model selection to openai endpoint 2023-10-09 23:51:26 -06:00
Cyberes 5f7bf4faca misc changes 2023-10-09 18:12:12 -06:00
Cyberes ae4d4e5ca9 fix exception 2023-10-09 10:31:35 -06:00
Cyberes 467e1893ea fix issue with null data on openai 2023-10-08 19:36:12 -06:00
Cyberes 3e5feb9c97 fix stat 2023-10-05 21:43:49 -06:00
Cyberes e8964fcfd2 fix the queue?? 2023-10-05 21:37:18 -06:00
Cyberes ea61766838 fix 2023-10-05 20:19:57 -06:00
Cyberes da20d1807b actually wait again 2023-10-05 20:17:42 -06:00
Cyberes e9f6fdf65e fix streaming? 2023-10-05 20:14:28 -06:00
Cyberes 67173f30dd t 2023-10-05 19:35:12 -06:00
Cyberes 8df667bc0a t 2023-10-05 19:25:08 -06:00
Cyberes c4cc7bbaa0 f 2023-10-05 19:13:43 -06:00
Cyberes 0bef14ea55 t 2023-10-05 19:11:59 -06:00
Cyberes 5540112607 t 2023-10-05 19:09:25 -06:00
Cyberes 9befda5acb c 2023-10-05 19:05:41 -06:00
Cyberes 50992116f5 fix 2023-10-05 18:59:19 -06:00
Cyberes 96dd62478f fix 2023-10-05 18:53:19 -06:00
Cyberes a37b12a221 t 2023-10-05 18:51:56 -06:00
Cyberes 46d44f95ac t 2023-10-05 18:50:10 -06:00
Cyberes 817c454c89 t 2023-10-05 18:48:07 -06:00
Cyberes 9b819573e8 fix import error 2023-10-05 18:39:31 -06:00
Cyberes e07e31df0a fix 2023-10-05 18:37:50 -06:00
Cyberes 0718f10eb9 t 2023-10-05 18:36:02 -06:00
Cyberes fb8bc05b4c t 2023-10-05 18:30:54 -06:00
Cyberes 6be1e9acd3 t 2023-10-05 18:16:17 -06:00
Cyberes 10eb6269b7 t 2023-10-05 18:15:02 -06:00
Cyberes 64d7a9edbb fix 2023-10-05 18:09:24 -06:00
Cyberes 5a61bdccd4 f 2023-10-05 18:07:59 -06:00
Cyberes 3d0a5cf0a2 t 2023-10-05 18:06:36 -06:00
Cyberes 01fb619b9b f 2023-10-05 18:05:31 -06:00
Cyberes a229b4d6c5 c 2023-10-05 18:04:29 -06:00
Cyberes a53790ee37 fix??? 2023-10-05 18:01:08 -06:00
Cyberes 84c1ed8737 t 2023-10-05 17:58:23 -06:00
Cyberes 979a945466 t 2023-10-05 17:55:44 -06:00
Cyberes 19e62be3e8 t 2023-10-05 17:41:01 -06:00
Cyberes 27e461c76b test 2023-10-05 17:00:35 -06:00
Cyberes 08df52a4fd fix exception when not valid model 2023-10-05 12:28:00 -06:00
Cyberes acf409abfc fix background logger, add gradio chat example 2023-10-04 19:24:47 -06:00
Cyberes 1670594908 fix import error 2023-10-04 16:29:19 -06:00
Cyberes 6723dd79dc fix exceptoin 2023-10-04 16:04:03 -06:00
Cyberes 09fa69e031 fix 2023-10-04 13:37:39 -06:00
Cyberes 62d5d43da4 handle backend offline in tokenizer 2023-10-04 13:34:59 -06:00
Cyberes 7acaa3c885 g 2023-10-04 13:32:54 -06:00
Cyberes d78ef652fc c 2023-10-04 13:21:43 -06:00
Cyberes 5e90fa54d4 handle model offline 2023-10-04 13:18:47 -06:00
Cyberes 754a4cbdf3 r 2023-10-04 13:11:43 -06:00
Cyberes d0eec88dbd f 2023-10-04 13:03:58 -06:00
Cyberes 6bad5b3fa0 t 2023-10-04 13:02:53 -06:00
Cyberes 77db34a6a7 g 2023-10-04 12:59:19 -06:00
Cyberes 364b795268 fix 2023-10-04 12:57:11 -06:00
Cyberes 7cb624c5f5 f 2023-10-04 12:47:59 -06:00
Cyberes 6af5365015 c 2023-10-04 12:45:20 -06:00
Cyberes f3a13fcda8 c 2023-10-04 12:44:33 -06:00
Cyberes a15b5465df c 2023-10-04 12:44:09 -06:00
Cyberes 95d781725e t 2023-10-04 12:42:18 -06:00
Cyberes 1b21cb69c1 test 2023-10-04 12:40:29 -06:00
Cyberes 4deb32bf1c test 2023-10-04 10:32:11 -06:00
Cyberes 7e3af3599d test 2023-10-04 10:29:58 -06:00
Cyberes 4634e36eeb text 2023-10-04 10:26:39 -06:00
Cyberes b76e77a66a fix exception 2023-10-04 10:24:28 -06:00
Cyberes 5f4e4710c1 option to prioritize by parameter count 2023-10-04 10:19:44 -06:00
Cyberes 6dc3529190 show online status on stats page 2023-10-03 23:39:25 -06:00
Cyberes 1a7f22ec55 adjust again 2023-10-03 20:47:37 -06:00
Cyberes 67f5df9bb9 fix stats page 2023-10-03 20:42:53 -06:00
Cyberes f88e2362c5 remove some debug prints 2023-10-03 20:01:28 -06:00
Cyberes 33b4b8404b clean up streaming 2023-10-03 14:10:50 -06:00
Cyberes e16f415749 fix 2023-10-03 13:49:00 -06:00
Cyberes 581a0fec99 fix exception 2023-10-03 13:47:18 -06:00
Cyberes 32ad97e57c do default model rather than default backend, adjust moderation endpoint logic and add timeout, exclude system tokens from recent proompters, calculate number of moderators from endpoint concurrent gens, adjust homepage 2023-10-03 13:40:08 -06:00
Cyberes 63c12ea830 fix 2023-10-03 01:25:43 -06:00
Cyberes ca1baa4870 test 2023-10-03 00:15:16 -06:00
Cyberes 62eb0196cc t 2023-10-03 00:13:55 -06:00
Cyberes 0f5e22191c test 2023-10-03 00:12:37 -06:00
Cyberes 70126acdf2 test 2023-10-03 00:12:13 -06:00
Cyberes f6acd67738 t 2023-10-03 00:05:32 -06:00
Cyberes 07d6f6d8e9 test 2023-10-03 00:03:39 -06:00
Cyberes cd325216e2 test 2023-10-02 22:45:07 -06:00
Cyberes aed5db4968 trying to narrow down error 2023-10-02 21:43:36 -06:00
Cyberes 94141b8ecf fix processing not being decremented on streaming, fix confusion over queue, adjust stop sequences 2023-10-02 20:53:08 -06:00
Cyberes 4f226ae38e handle requests to offline backends 2023-10-02 11:11:48 -06:00
Cyberes b0089859d7 fix ratelimiting 2023-10-02 02:05:15 -06:00
Cyberes d1c4e68f8b fix openai models response 2023-10-01 23:07:49 -06:00
Cyberes 21da2f6373 fix openai error message 2023-10-01 22:58:08 -06:00
Cyberes a594729d00 fix keyerror 2023-10-01 22:37:13 -06:00
Cyberes 51881ae39d fix tokenizer 2023-10-01 17:19:34 -06:00
Cyberes f7e9687527 finish openai endpoints 2023-10-01 16:04:53 -06:00
Cyberes 2a3ff7e21e update openai endpoints 2023-10-01 14:15:01 -06:00
Cyberes 93d19fb95b fix exception 2023-10-01 10:25:32 -06:00
Cyberes d203973e80 fix routes 2023-10-01 01:13:13 -06:00
Cyberes 25ec56a5ef get streaming working, remove /v2/ 2023-10-01 00:20:00 -06:00
Cyberes b10d22ca0d cache the home page in the background 2023-09-30 23:03:42 -06:00
Cyberes 3ecb7bcf88 adjust jinja template 2023-09-30 22:14:50 -06:00
Cyberes c5b30d985c adjust jinja template 2023-09-30 22:11:51 -06:00
Cyberes bc25d92c95 reduce tokens for backend tester 2023-09-30 21:48:16 -06:00
Cyberes 9235725bdd adjust message 2023-09-30 21:35:55 -06:00
Cyberes 61856b4383 adjust message 2023-09-30 21:34:32 -06:00
Cyberes 7af3dbd76b add message about settings 2023-09-30 21:31:25 -06:00
Cyberes 592eb08cb1 add message for /v1/ 2023-09-30 21:07:12 -06:00
Cyberes 166b2316e8 depricate v1 2023-09-30 20:59:24 -06:00
Cyberes 1151bb5475 adjust stats 2023-09-30 20:42:48 -06:00
Cyberes 91ba2fad1b add proompter stats back in 2023-09-30 20:11:14 -06:00
Cyberes e553fa6e9f adjust home page fontsize 2023-09-30 20:01:36 -06:00
Cyberes 11a10f85c1 adjust home page 2023-09-30 19:59:30 -06:00
Cyberes e6267a7d46 remove vllm from requirements.txt 2023-09-30 19:45:04 -06:00
Cyberes e0f86d053a reorganize to api v2 2023-09-30 19:42:41 -06:00
Cyberes 114f36e709 functional 2023-09-30 19:41:50 -06:00
Cyberes 624ca74ce5 mvp 2023-09-29 00:09:44 -06:00
Cyberes e7b57cad7b set up cluster config and basic background workers 2023-09-28 18:40:24 -06:00