Cyberes
|
79b1e01b61
|
option to disable streaming, improve timeout on requests to backend, fix error handling. reduce duplicate code, misc other cleanup
|
2023-09-14 14:05:50 -06:00 |
Cyberes
|
c45e68a8c8
|
adjust requests timeout, add service file
|
2023-09-14 01:32:49 -06:00 |
Cyberes
|
1d9f40765e
|
remove text-generation-inference backend
|
2023-09-12 13:09:47 -06:00 |
Cyberes
|
40ac84aa9a
|
actually we don't want to emulate openai
|
2023-09-12 01:04:11 -06:00 |
Cyberes
|
4c9d543eab
|
implement vllm backend
|
2023-09-11 20:47:19 -06:00 |
Cyberes
|
c14cc51f09
|
get working with ooba again, give up on dockerfile
|
2023-09-11 09:51:01 -06:00 |
Cyberes
|
f9b9051bad
|
update weighted_average_column_for_model to account for when there was an error reported, insert null for response tokens when error, correctly parse x-forwarded-for, correctly convert model reported by hf-textgen
|
2023-08-29 15:46:56 -06:00 |
Cyberes
|
7eb930fafd
|
crap
|
2023-08-23 16:12:25 -06:00 |
Cyberes
|
9fc674878d
|
allow disabling ssl verification
|
2023-08-23 16:11:32 -06:00 |
Cyberes
|
508089ce11
|
model info timeout and additional info
|
2023-08-23 16:07:43 -06:00 |
Cyberes
|
1f5e2da637
|
print fetch model error message
|
2023-08-23 16:02:57 -06:00 |
Cyberes
|
0d32db2dbd
|
prototype hf-textgen and adjust logging
|
2023-08-22 19:58:31 -06:00 |