hf_text-generation-inference/backends/llamacpp
Morgan Funtowicz b9c04b9c07 misc(doc): c++ documentation 2024-11-22 15:13:54 +01:00
..
cmake feat(backend): bind thread and memory affinity for thread 2024-11-21 13:52:38 +01:00
csrc misc(doc): c++ documentation 2024-11-22 15:13:54 +01:00
offline feat(backend): simplify overall cpp structure 2024-11-14 08:42:01 +01:00
src misc(backend): allow rebinding numa core affinity 2024-11-22 14:02:58 +01:00
CMakeLists.txt feat(backend): multistream inference on CPU 2024-11-21 00:03:05 +01:00
Cargo.toml feat(backend): rely on multi consumer queue to scheduler workers 2024-11-22 13:32:56 +01:00
build.rs feat(backend): bind thread and memory affinity for thread 2024-11-21 13:52:38 +01:00