preemo_text-generation-infe.../router/client
OlivierDehaene e74bd41e0f
feat(server): add paged attention to flash models (#516)
Closes #478
2023-06-30 19:09:59 +02:00
..
src feat(server): add paged attention to flash models (#516) 2023-06-30 19:09:59 +02:00
Cargo.toml feat(docker): add benchmarking tool to docker image (#298) 2023-05-09 13:19:31 +02:00
build.rs feat: add distributed tracing (#62) 2023-02-13 13:02:45 +01:00