models
|
GPTNeoX: Use static rotary embedding (#1498)
|
2024-02-01 09:34:11 +01:00 |
pb
|
feat(server): clear cache on error (#143)
|
2023-03-28 11:29:35 +02:00 |
utils
|
Fixing top_n_tokens. (#1497)
|
2024-01-26 20:13:47 +01:00 |
__init__.py
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |
cli.py
|
v1.4.0 (#1494)
|
2024-01-26 19:04:57 +01:00 |
interceptor.py
|
feat(server): empty cache on errors
|
2023-07-12 17:06:19 +02:00 |
server.py
|
fix: fix gpt-q with groupsize = -1 (#1358)
|
2023-12-18 16:07:05 +01:00 |
tracing.py
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |