Cyberes
|
40ac84aa9a
|
actually we don't want to emulate openai
|
2023-09-12 01:04:11 -06:00 |
Cyberes
|
bf648f605f
|
implement streaming for hf-textgen
|
2023-08-29 17:56:12 -06:00 |
Cyberes
|
f9b9051bad
|
update weighted_average_column_for_model to account for when there was an error reported, insert null for response tokens when error, correctly parse x-forwarded-for, correctly convert model reported by hf-textgen
|
2023-08-29 15:46:56 -06:00 |
Cyberes
|
da77a24eaa
|
damn
|
2023-08-29 14:58:08 -06:00 |
Cyberes
|
2d9ec15302
|
I swear I know what I'm doing
|
2023-08-29 14:57:49 -06:00 |
Cyberes
|
23f3fcf579
|
log errors to database
|
2023-08-29 14:48:33 -06:00 |
Cyberes
|
ba0bc87434
|
add HF text-generation-inference backend
|
2023-08-29 13:46:41 -06:00 |
Cyberes
|
441a870e85
|
calculate weighted average for stat tracking
|
2023-08-27 19:58:04 -06:00 |
Cyberes
|
6a09ffc8a4
|
log model used in request so we can pull the correct averages when we change models
|
2023-08-26 00:30:59 -06:00 |
Cyberes
|
2543db87e8
|
fix database error
|
2023-08-25 12:25:30 -06:00 |
Cyberes
|
839bb115c6
|
reorganize stats, add 24 hr proompters, adjust logging when error
|
2023-08-25 12:20:16 -06:00 |
Cyberes
|
ec3fe2c2ac
|
show total output tokens on stats
|
2023-08-24 20:43:11 -06:00 |
Cyberes
|
763dd832cc
|
update home, update readme, calculate estimated wait based on database stats
|
2023-08-24 16:47:14 -06:00 |
Cyberes
|
dca2dc570f
|
adjust db gen time
|
2023-08-23 22:28:03 -06:00 |
Cyberes
|
e52acb03a4
|
log gen time to DB, also keep generation_elapsed under 3 min
|
2023-08-23 22:20:39 -06:00 |
Cyberes
|
11a0b6541f
|
fix some stuff related to gunicorn workers
|
2023-08-23 22:01:06 -06:00 |
Cyberes
|
6f8b70df54
|
add a queue system
|
2023-08-23 20:12:38 -06:00 |
Cyberes
|
0d32db2dbd
|
prototype hf-textgen and adjust logging
|
2023-08-22 19:58:31 -06:00 |
Cyberes
|
e04d6a8a13
|
minor adjustments
|
2023-08-21 22:49:44 -06:00 |
Cyberes
|
8cbf643fd3
|
MVP
|
2023-08-21 21:28:52 -06:00 |