hf_text-generation-inference/proto
OlivierDehaene fe80f5360c
feat(server): auto max_batch_total_tokens for flash att models (#630)
2023-07-19 09:31:25 +02:00
..
generate.proto feat(server): auto max_batch_total_tokens for flash att models (#630) 2023-07-19 09:31:25 +02:00