hf_text-generation-inference/backends/trtllm
Morgan Funtowicz 7b9f92a0aa use spdlog release 1.14.1 moving forward 2024-07-10 13:47:31 +00:00
..
cmake use spdlog release 1.14.1 moving forward 2024-07-10 13:47:31 +00:00
include implement the Stream method to send new tokens through a callback 2024-07-09 13:46:48 +00:00
lib implement the Stream method to send new tokens through a callback 2024-07-09 13:46:48 +00:00
scripts Overall build TRTLLM and deps through CMake build system 2024-07-02 17:16:27 +02:00
src unconditionally call InitializeBackend on the FFI layer 2024-07-08 22:09:09 +00:00
tests First version loading engines and making it ready for inference 2024-07-03 21:12:24 +00:00
CMakeLists.txt bind to CUDA::nvml to retrieve compute capabilities at runtime 2024-07-08 22:32:41 +00:00
Cargo.toml Enable end to end CMake build 2024-07-03 10:27:53 +02:00
build.rs Enable end to end CMake build 2024-07-03 10:27:53 +02:00