Commit Graph

238 Commits

Author SHA1 Message Date
Cyberes b44dfa2471 update info page 2023-08-29 14:00:35 -06:00
Cyberes ba0bc87434 add HF text-generation-inference backend 2023-08-29 13:46:41 -06:00
Cyberes 6c0e60135d exclude tokens with priority 0 from simultaneous requests ratelimit 2023-08-28 00:03:25 -06:00
Cyberes c16d70a24d limit amount of simultaneous requests an IP can make 2023-08-27 23:48:10 -06:00
Cyberes 1a4cb5f786 reorganize stats page again 2023-08-27 22:24:44 -06:00
Cyberes f43336c92c adjust estimated wait time calculations 2023-08-27 22:17:21 -06:00
Cyberes 441a870e85 calculate weighted average for stat tracking 2023-08-27 19:58:04 -06:00
Cyberes 6a09ffc8a4 log model used in request so we can pull the correct averages when we change models 2023-08-26 00:30:59 -06:00
Cyberes 0150bbf8dd actually we do know how long it will take 2023-08-25 15:17:01 -06:00
Cyberes d64152587c reorganize nvidia stats 2023-08-25 15:02:40 -06:00
Cyberes c6edeb2b70 There will be a wait if the queue is empty but prompts are processing 2023-08-25 13:53:23 -06:00
Cyberes 0e6aadf5e1 fix missing empty strings logged when errors 2023-08-25 13:44:41 -06:00
Cyberes 2543db87e8 fix database error 2023-08-25 12:25:30 -06:00
Cyberes 839bb115c6 reorganize stats, add 24 hr proompters, adjust logging when error 2023-08-25 12:20:16 -06:00
Cyberes 26a0a13aa7 actually we want this 2023-08-24 23:57:46 -06:00
Cyberes 0b4da89de2 fix exception 2023-08-24 23:57:25 -06:00
Cyberes 25e3255c9b fix issue with tokenizer 2023-08-24 23:13:07 -06:00
Cyberes 77fe1e237e also handle when no response 2023-08-24 22:53:54 -06:00
Cyberes e5aca7b09d adjust netdata json, don't log error messages during generationg 2023-08-24 22:53:06 -06:00
Cyberes 06173f900e remove debug print 2023-08-24 22:02:15 -06:00
Cyberes 0230ddda17 dynamically fetch GPUs for netdata 2023-08-24 21:56:15 -06:00
Cyberes 16b986c206 track nvidia power states through netdata 2023-08-24 21:36:00 -06:00
Cyberes 01b8442b95 update current model when we generate_stats() 2023-08-24 21:10:00 -06:00
Cyberes ec3fe2c2ac show total output tokens on stats 2023-08-24 20:43:11 -06:00
Cyberes 9b7bf490a1 sort keys of stats dict 2023-08-24 18:59:52 -06:00
Cyberes 519ce7814c change background color 2023-08-24 18:00:04 -06:00
Cyberes 9b526e6150 actually we wanted that style 2023-08-24 17:58:01 -06:00
Cyberes d55f1b6b24 un-style highlight.js 2023-08-24 17:57:15 -06:00
Cyberes c397649097 restyle homepage, add config item to add content to the home page 2023-08-24 17:55:55 -06:00
Cyberes d1b227fd43 adjust caching on home page 2023-08-24 16:48:36 -06:00
Cyberes 763dd832cc update home, update readme, calculate estimated wait based on database stats 2023-08-24 16:47:14 -06:00
Cyberes 21174750ea update readme 2023-08-24 12:19:59 -06:00
Cyberes 1cb6389a8a update gitignore 2023-08-24 00:10:22 -06:00
Cyberes afc138c743 update readme 2023-08-24 00:09:57 -06:00
Cyberes bf1842f434 update readme 2023-08-23 23:48:46 -06:00
Cyberes f7743ade89 add dynamic analitics tracking to home page 2023-08-23 23:27:33 -06:00
Cyberes c2b0fa1cf5 remove debug code 2023-08-23 23:14:50 -06:00
Cyberes 48f4e0c7dd adjust caching 2023-08-23 23:14:18 -06:00
Cyberes f3fe514c11 add home template 2023-08-23 23:11:12 -06:00
Cyberes dca2dc570f adjust db gen time 2023-08-23 22:28:03 -06:00
Cyberes cdda2c840c dont test code, don't care 2023-08-23 22:24:32 -06:00
Cyberes 1eb8e885d0 am dumb 2023-08-23 22:22:38 -06:00
Cyberes 4ffee69b1d oops 2023-08-23 22:21:59 -06:00
Cyberes e52acb03a4 log gen time to DB, also keep generation_elapsed under 3 min 2023-08-23 22:20:39 -06:00
Cyberes 3317bd5f1a allow hiding of more variables 2023-08-23 22:08:10 -06:00
Cyberes 11a0b6541f fix some stuff related to gunicorn workers 2023-08-23 22:01:06 -06:00
Cyberes 02c07bbd53 pycarm deeleted import 2023-08-23 21:34:27 -06:00
Cyberes de19af900f add estimated wait time and other time tracking stats 2023-08-23 21:33:52 -06:00
Cyberes 0aa52863bc forgot to start workers 2023-08-23 20:33:49 -06:00
Cyberes 6f8b70df54 add a queue system 2023-08-23 20:12:38 -06:00