layers
|
Hotfix GPTQ.
|
2024-06-03 09:32:12 +00:00 |
models
|
allow to fix paged attention num blocks
|
2024-06-05 10:05:04 +00:00 |
pb
|
chore: add pre-commit (#1569)
|
2024-02-16 11:58:58 +01:00 |
utils
|
Hotfix GPTQ.
|
2024-06-03 09:32:12 +00:00 |
__init__.py
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |
cli.py
|
Add support for exl2 quantization
|
2024-05-30 11:28:05 +02:00 |
interceptor.py
|
v2.0.0 (#1736)
|
2024-04-12 18:38:34 +02:00 |
server.py
|
Add support for exl2 quantization
|
2024-05-30 11:28:05 +02:00 |
tracing.py
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |