.. |
basic_tutorials
|
Update Quantization docs and minor doc fix. (#2368)
|
2024-08-08 16:06:57 -04:00 |
conceptual
|
Using an enum for flash backens (paged/flashdecoding/flashinfer) (#2385)
|
2024-08-09 16:41:17 +02:00 |
_toctree.yml
|
add usage stats to toctree (#2260)
|
2024-07-19 16:34:04 +02:00 |
architecture.md
|
add doc for intel gpus (#2181)
|
2024-07-08 15:57:06 +02:00 |
index.md
|
fix typos in docs and add small clarifications (#1790)
|
2024-04-22 12:15:48 -04:00 |
installation.md
|
MI300 compatibility (#1764)
|
2024-05-17 15:30:47 +02:00 |
installation_amd.md
|
Preparing for release. (#2285)
|
2024-07-23 16:20:17 +02:00 |
installation_gaudi.md
|
MI300 compatibility (#1764)
|
2024-05-17 15:30:47 +02:00 |
installation_inferentia.md
|
MI300 compatibility (#1764)
|
2024-05-17 15:30:47 +02:00 |
installation_intel.md
|
Preparing for release. (#2285)
|
2024-07-23 16:20:17 +02:00 |
installation_nvidia.md
|
Preparing for release. (#2285)
|
2024-07-23 16:20:17 +02:00 |
messages_api.md
|
chore: add pre-commit (#1569)
|
2024-02-16 11:58:58 +01:00 |
quicktour.md
|
Update documentation for Supported models (#2386)
|
2024-08-09 15:01:34 +02:00 |
supported_models.md
|
Update documentation for Supported models (#2386)
|
2024-08-09 15:01:34 +02:00 |
usage_statistics.md
|
refactor usage stats (#2339)
|
2024-07-31 16:29:07 +02:00 |