hf_text-generation-inference/integration-tests
drbh 343aa7a197
fix: Handle concurrent grammar requests (#1610)
This PR fixes parallel grammar requests, currently grammar states are
not concatenated correctly when a new request is added to the batch and
this results in incorrect generation. This PR updates the `concatenate`
function to correctly include the previous states.

fixes: #1601
2024-02-29 11:17:42 +01:00
..
models fix: Handle concurrent grammar requests (#1610) 2024-02-29 11:17:42 +01:00
conftest.py Support tools (#1587) 2024-02-28 11:10:27 +01:00
poetry.lock New release. (#941) 2023-08-29 14:28:22 +02:00
pyproject.toml v1.4.3 (#1609) 2024-02-28 16:12:14 +01:00
pytest.ini chore: add pre-commit (#1569) 2024-02-16 11:58:58 +01:00
requirements.txt New release. (#941) 2023-08-29 14:28:22 +02:00