hf_text-generation-inference/backends
Morgan Funtowicz 38b5263c61 (ffi) add max_new_tokens parameters 2024-10-21 10:00:27 +02:00
..
client feat: prefill chunking (#2600) 2024-10-16 12:49:33 +02:00
grpc-metadata Rebase TRT-llm (#2331) 2024-07-31 10:33:10 +02:00
trtllm (ffi) add max_new_tokens parameters 2024-10-21 10:00:27 +02:00
v2 feat: prefill chunking (#2600) 2024-10-16 12:49:33 +02:00
v3 feat: prefill chunking (#2600) 2024-10-16 12:49:33 +02:00