Cyberes
|
11e84db59c
|
update database, tokenizer handle null prompt, convert top_p to vllm on openai, actually validate prompt on streaming,
|
2023-09-25 22:32:48 -06:00 |
Cyberes
|
2d299dbae5
|
openai_force_no_hashes
|
2023-09-25 22:01:57 -06:00 |
Cyberes
|
8240a1ebbb
|
fix background log not doing anything
|
2023-09-25 18:18:29 -06:00 |
Cyberes
|
289b40181c
|
forgot to test all config possibilities
|
2023-09-25 17:23:43 -06:00 |
Cyberes
|
30282479a0
|
fix flask exception
|
2023-09-25 17:22:28 -06:00 |
Cyberes
|
135bd743bb
|
fix homepage slowness, fix incorrect 24 hr prompters, fix redis wrapper,
|
2023-09-25 17:20:21 -06:00 |
Cyberes
|
bbe5d5a8fe
|
improve openai endpoint, exclude system tokens more places
|
2023-09-25 09:32:23 -06:00 |
Cyberes
|
6459a1c91b
|
allow setting simultaneous IP limit per-token, fix token use tracker, fix tokens on streaming
|
2023-09-25 00:55:20 -06:00 |
Cyberes
|
320f51e01c
|
further align openai endpoint with expected responses
|
2023-09-24 21:45:30 -06:00 |
Cyberes
|
8d6b2ce49c
|
minor changes, add admin token auth system, add route to get backend info
|
2023-09-24 15:54:35 -06:00 |
Cyberes
|
cb99c3490e
|
rewrite tokenizer, restructure validation
|
2023-09-24 13:02:30 -06:00 |
Cyberes
|
62412f4873
|
add config setting for hostname
|
2023-09-23 23:24:08 -06:00 |
Cyberes
|
84a1fcfdd8
|
don't store host if it's an IP
|
2023-09-23 23:14:22 -06:00 |
Cyberes
|
fab7b7ccdd
|
active gen workers wait
|
2023-09-23 21:17:13 -06:00 |
Cyberes
|
03e3ec5490
|
port to mysql, use vllm tokenizer endpoint
|
2023-09-20 20:30:31 -06:00 |
Cyberes
|
3c1254d3bf
|
cache stats in background
|
2023-09-17 18:55:36 -06:00 |
Cyberes
|
3100b0a924
|
set up queue to work with gunicorn processes, other improvements
|
2023-09-14 17:38:20 -06:00 |
Cyberes
|
a89295193f
|
add moderation endpoint to openai api, update config
|
2023-09-14 15:07:17 -06:00 |
Cyberes
|
507327db49
|
lower caching of home page
|
2023-09-14 14:30:01 -06:00 |
Cyberes
|
79b1e01b61
|
option to disable streaming, improve timeout on requests to backend, fix error handling. reduce duplicate code, misc other cleanup
|
2023-09-14 14:05:50 -06:00 |
Cyberes
|
035c17c48b
|
reformat info page info_html field
|
2023-09-13 20:40:55 -06:00 |
Cyberes
|
12e894032e
|
show the openai system prompt
|
2023-09-13 20:25:56 -06:00 |
Cyberes
|
9740df07c7
|
add openai-compatible backend
|
2023-09-12 16:40:09 -06:00 |
Cyberes
|
1d9f40765e
|
remove text-generation-inference backend
|
2023-09-12 13:09:47 -06:00 |
Cyberes
|
6152b1bb66
|
fix invalid param error, add manual model name
|
2023-09-12 10:30:45 -06:00 |
Cyberes
|
57ccedcfb9
|
adjust some things
|
2023-09-12 01:10:58 -06:00 |
Cyberes
|
a84386c311
|
move import check furthger up
|
2023-09-12 01:05:03 -06:00 |
Cyberes
|
40ac84aa9a
|
actually we don't want to emulate openai
|
2023-09-12 01:04:11 -06:00 |
Cyberes
|
4c9d543eab
|
implement vllm backend
|
2023-09-11 20:47:19 -06:00 |
Cyberes
|
c14cc51f09
|
get working with ooba again, give up on dockerfile
|
2023-09-11 09:51:01 -06:00 |
Cyberes
|
2816c01902
|
refactor generation route
|
2023-08-30 18:53:26 -06:00 |
Cyberes
|
bf648f605f
|
implement streaming for hf-textgen
|
2023-08-29 17:56:12 -06:00 |
Cyberes
|
b44dfa2471
|
update info page
|
2023-08-29 14:00:35 -06:00 |
Cyberes
|
ba0bc87434
|
add HF text-generation-inference backend
|
2023-08-29 13:46:41 -06:00 |
Cyberes
|
c16d70a24d
|
limit amount of simultaneous requests an IP can make
|
2023-08-27 23:48:10 -06:00 |
Cyberes
|
1a4cb5f786
|
reorganize stats page again
|
2023-08-27 22:24:44 -06:00 |
Cyberes
|
0150bbf8dd
|
actually we do know how long it will take
|
2023-08-25 15:17:01 -06:00 |
Cyberes
|
c6edeb2b70
|
There will be a wait if the queue is empty but prompts are processing
|
2023-08-25 13:53:23 -06:00 |
Cyberes
|
0230ddda17
|
dynamically fetch GPUs for netdata
|
2023-08-24 21:56:15 -06:00 |
Cyberes
|
16b986c206
|
track nvidia power states through netdata
|
2023-08-24 21:36:00 -06:00 |
Cyberes
|
ec3fe2c2ac
|
show total output tokens on stats
|
2023-08-24 20:43:11 -06:00 |
Cyberes
|
c397649097
|
restyle homepage, add config item to add content to the home page
|
2023-08-24 17:55:55 -06:00 |
Cyberes
|
d1b227fd43
|
adjust caching on home page
|
2023-08-24 16:48:36 -06:00 |
Cyberes
|
763dd832cc
|
update home, update readme, calculate estimated wait based on database stats
|
2023-08-24 16:47:14 -06:00 |
Cyberes
|
f7743ade89
|
add dynamic analitics tracking to home page
|
2023-08-23 23:27:33 -06:00 |
Cyberes
|
48f4e0c7dd
|
adjust caching
|
2023-08-23 23:14:18 -06:00 |
Cyberes
|
f3fe514c11
|
add home template
|
2023-08-23 23:11:12 -06:00 |
Cyberes
|
4ffee69b1d
|
oops
|
2023-08-23 22:21:59 -06:00 |
Cyberes
|
3317bd5f1a
|
allow hiding of more variables
|
2023-08-23 22:08:10 -06:00 |
Cyberes
|
11a0b6541f
|
fix some stuff related to gunicorn workers
|
2023-08-23 22:01:06 -06:00 |
Cyberes
|
de19af900f
|
add estimated wait time and other time tracking stats
|
2023-08-23 21:33:52 -06:00 |
Cyberes
|
0aa52863bc
|
forgot to start workers
|
2023-08-23 20:33:49 -06:00 |
Cyberes
|
ff8bb0f7c6
|
shut up
|
2023-08-23 16:14:13 -06:00 |
Cyberes
|
9fc674878d
|
allow disabling ssl verification
|
2023-08-23 16:11:32 -06:00 |
Cyberes
|
9f14b166dd
|
fix proompters_1_min, other minor changes
|
2023-08-22 22:32:29 -06:00 |
Cyberes
|
a525093c75
|
rename, more stats
|
2023-08-22 20:42:38 -06:00 |
Cyberes
|
0d32db2dbd
|
prototype hf-textgen and adjust logging
|
2023-08-22 19:58:31 -06:00 |
Cyberes
|
a59dcea2da
|
more proxy stats
|
2023-08-22 16:50:49 -06:00 |
Cyberes
|
bd85b2a4f9
|
fix freeze on startup
|
2023-08-22 08:36:59 -06:00 |
Cyberes
|
ad9a91f1b5
|
concurrent gens setting, online status
|
2023-08-22 00:26:46 -06:00 |
Cyberes
|
6e3ddab42e
|
fix relative paths for db path
|
2023-08-21 23:07:12 -06:00 |
Cyberes
|
e04d6a8a13
|
minor adjustments
|
2023-08-21 22:49:44 -06:00 |
Cyberes
|
8cbf643fd3
|
MVP
|
2023-08-21 21:28:52 -06:00 |