adapters
|
feat: add ruff and resolve issue (#2262)
|
2024-07-26 10:29:09 -04:00 |
layers
|
Add support for fused MoE Marlin for AWQ (#2616)
|
2024-10-08 11:56:41 +02:00 |
models
|
enable mllama in intel platform (#2610)
|
2024-10-07 21:15:09 +02:00 |
pb
|
chore: add pre-commit (#1569)
|
2024-02-16 11:58:58 +01:00 |
utils
|
fix: adjust test to only run on cuda
|
2024-10-09 20:02:29 +00:00 |
__init__.py
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |
cli.py
|
Add basic FP8 KV cache support (#2603)
|
2024-10-04 17:51:48 +02:00 |
interceptor.py
|
v2.0.0 (#1736)
|
2024-04-12 18:38:34 +02:00 |
server.py
|
Add basic FP8 KV cache support (#2603)
|
2024-10-04 17:51:48 +02:00 |