adapters
|
feat: add ruff and resolve issue (#2262)
|
2024-07-26 10:29:09 -04:00 |
layers
|
Support qwen2 vl (#2689)
|
2024-10-30 12:40:51 -04:00 |
models
|
Logprobs cost too much.
|
2024-11-10 07:00:22 +01:00 |
pb
|
chore: add pre-commit (#1569)
|
2024-02-16 11:58:58 +01:00 |
utils
|
Add support for FP8 KV cache scales (#2628)
|
2024-10-24 16:36:18 +02:00 |
__init__.py
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |
cli.py
|
Support `e4m3fn` KV cache (#2655)
|
2024-10-17 10:42:16 +02:00 |
interceptor.py
|
feat: prefill chunking (#2600)
|
2024-10-16 12:49:33 +02:00 |