hf_text-generation-inference/backends/v3/src
Nicolas Patry 7a48a84784
Using an enum for flash backens (paged/flashdecoding/flashinfer) (#2385)
* Using an enum for flash backens (paged/flashdecoding/flashinfer)

* Early exit on server too.

* Clippy.

* Fix clippy and fmt.
2024-08-09 16:41:17 +02:00
..
client Rebase TRT-llm (#2331) 2024-07-31 10:33:10 +02:00
backend.rs Using an enum for flash backens (paged/flashdecoding/flashinfer) (#2385) 2024-08-09 16:41:17 +02:00
block_allocator.rs Rebase TRT-llm (#2331) 2024-07-31 10:33:10 +02:00
lib.rs Rebase TRT-llm (#2331) 2024-07-31 10:33:10 +02:00
main.rs Pr 2352 ci branch (#2382) 2024-08-09 10:54:32 +02:00
queue.rs Pr 2352 ci branch (#2382) 2024-08-09 10:54:32 +02:00