layers
|
Add support for GPTQ Marlin (#2052)
|
2024-06-14 09:45:42 +02:00 |
models
|
Support exl2-quantized Qwen2 models (#2085)
|
2024-06-20 07:56:16 +02:00 |
pb
|
chore: add pre-commit (#1569)
|
2024-02-16 11:58:58 +01:00 |
utils
|
Add support for GPTQ Marlin (#2052)
|
2024-06-14 09:45:42 +02:00 |
__init__.py
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |
cli.py
|
ROCm and sliding windows fixes (#2033)
|
2024-06-10 15:09:50 +08:00 |
interceptor.py
|
v2.0.0 (#1736)
|
2024-04-12 18:38:34 +02:00 |
tracing.py
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |