basic_tutorials
|
Add support for Marlin-quantized models
|
2024-06-06 13:16:52 +02:00 |
conceptual
|
Add support for exl2 quantization
|
2024-05-30 11:28:05 +02:00 |
_toctree.yml
|
Adding architecture document (#2044)
|
2024-06-14 09:28:34 -04:00 |
architecture.md
|
Adding architecture document (#2044)
|
2024-06-14 09:28:34 -04:00 |
installation.md
|
MI300 compatibility (#1764)
|
2024-05-17 15:30:47 +02:00 |
installation_gaudi.md
|
MI300 compatibility (#1764)
|
2024-05-17 15:30:47 +02:00 |
installation_inferentia.md
|
MI300 compatibility (#1764)
|
2024-05-17 15:30:47 +02:00 |
messages_api.md
|
chore: add pre-commit (#1569)
|
2024-02-16 11:58:58 +01:00 |
supported_models.md
|
Update the link for qwen2 (#2068)
|
2024-06-14 11:59:33 +02:00 |