Commit Graph

1217 Commits

Author SHA1 Message Date
Morgan Funtowicz 2b65669581 misc(backend): make sure to correctly set IS_GHA_BUILD=true in wf 2024-12-17 10:53:42 +01:00
Morgan Funtowicz e5cc47a42e misc(backend): missing env directive 2024-12-17 10:41:36 +01:00
Morgan Funtowicz b303013227 misc(backend): let's try with GHA 2024-12-17 10:40:51 +01:00
Morgan Funtowicz 7dcee83a63 misc(backend): once more? 2024-12-16 16:03:07 +01:00
Morgan Funtowicz 5ded9cbd22 misc(backend): test with TGI S3 conf 2024-12-16 15:51:03 +01:00
Morgan Funtowicz 79f1b953dc misc(backend): test with TGI S3 conf 2024-12-16 15:47:51 +01:00
Morgan Funtowicz 104885c71c misc(backend): indent 2024-12-13 15:37:19 +01:00
Morgan Funtowicz 8d9580669d misc(ci): WAT 2024-12-13 09:46:13 +01:00
Morgan Funtowicz d5052a9054 misc(ci): WAT 2024-12-13 09:43:22 +01:00
Morgan Funtowicz 83e919f617 misc(ci): WAT 2024-12-12 16:42:38 +01:00
Morgan Funtowicz e312a68469 misc(ci): WAT 2024-12-12 16:14:48 +01:00
Morgan Funtowicz 4aa060f99a misc(ci): WAT 2024-12-12 16:08:01 +01:00
Morgan Funtowicz 6c62ded864 misc(ci): do not build with ssl enabled 2024-12-12 15:57:11 +01:00
Morgan Funtowicz b31477cf63 misc(ci): lets actually use sccache ... 2024-12-12 15:38:44 +01:00
Morgan Funtowicz 68f5466c86 misc(ci): add debug profile 2024-12-12 14:52:12 +01:00
Morgan Funtowicz e37d131b8e misc(ci): add debug profile 2024-12-12 14:36:57 +01:00
Morgan Funtowicz f99049aafe misc(ci): again 2024-12-12 14:24:33 +01:00
Morgan Funtowicz 20f8cb9a95 misc(ci): again 2024-12-12 13:11:54 +01:00
Morgan Funtowicz e6abfdcb1f misc(ci): let's try this way 2024-12-12 13:00:13 +01:00
Morgan Funtowicz 5f1b16f300 misc(ci): export aws creds as output of step 2024-12-12 12:48:58 +01:00
Morgan Funtowicz 5910dabb4e misc(ci): provide mecanism to cache inside container 2024-12-12 12:45:14 +01:00
Morgan Funtowicz e703c84578 misc(ci): let's try to build the Dockerfile for trtllm 2024-12-12 11:50:39 +01:00
Morgan Funtowicz 48a1a602e7 misc(ci): update Rust action toolchain 2024-12-12 09:40:07 +01:00
Morgan Funtowicz de36c8e6dd misc(ci): enabe building tensorrt-llm 2024-12-12 09:29:14 +01:00
Morgan Funtowicz b653605e54 feat(trtllm): fix logits retrieval 2024-12-10 23:28:13 +01:00
Morgan Funtowicz 60059b6968 feat(trtllm): expose finish reason to Rust 2024-12-10 16:51:22 +01:00
Morgan Funtowicz f0cd4742c2 misc(backend): fix reborrowing Pin<&mut T> as described in the doc https://doc.rust-lang.org/stable/std/pin/struct.Pin.html#method.as_mut 2024-12-05 16:31:19 +01:00
Morgan Funtowicz 049f4acd5b feat(backend): fix missing "0" field access 2024-12-05 15:29:23 +01:00
Morgan Funtowicz b3cd5ea076 feat(backend): make sure we can easily cancel request on the executor 2024-12-05 13:54:56 +01:00
Morgan Funtowicz 300f6c6f94 feat(backend) fix moving backend when pulling 2024-12-04 17:32:14 +01:00
Morgan Funtowicz 460f290d5b effectively cancel the request on the executor 2024-12-04 14:29:04 +01:00
Morgan Funtowicz b6dbf605af chore(trtllm): update dependency towards 0.15.0 2024-12-04 12:02:42 +01:00
Morgan Funtowicz cc6bc339e5 test(backend): more test coverage 2024-12-04 00:16:02 +01:00
Morgan Funtowicz 62530649b8 feat(backend): remove constexpig 2024-12-03 16:47:48 +01:00
Morgan Funtowicz 881527a544 feat(backend): remove constexpr from par 2024-12-03 16:46:59 +01:00
Morgan Funtowicz ad3ed0d1a1 test(backend): add more unittest 2024-12-03 14:39:10 +01:00
Morgan Funtowicz c94b9de445 feat(backend): add guard to multiple header definitions 2024-12-03 14:07:49 +01:00
Morgan Funtowicz 16ba2f5a2b feat(backend): fix main.rs retrieving the tokenizer 2024-12-03 12:11:17 +01:00
Morgan Funtowicz 874bc28d6c feat(backend): make backend_workspace_t::engines_folder constexpr 2024-12-03 09:43:03 +01:00
Morgan Funtowicz 2f8634ec01 feat(backend): impl missing generation_step_t as return value of pull_tokens 2024-12-03 09:43:03 +01:00
Morgan Funtowicz a7bad25c41 feat(backend): fix backend_exception_t -> backend_error_t naming 2024-12-03 09:43:03 +01:00
Morgan Funtowicz 879e1a4178 feat(backend): allow overriding which Python to use 2024-12-03 09:43:03 +01:00
Morgan Funtowicz 71e700a6ea feat(backend): use latest trtllm main version to have g++ >= 13 compatibility 2024-12-03 09:43:03 +01:00
Morgan Funtowicz fd7e2b5bbd feat(backend): more impl 2024-12-03 09:43:03 +01:00
Morgan Funtowicz df99164dc1 feat(backend): delete previous backend impl 2024-12-03 09:43:03 +01:00
Morgan Funtowicz 25c6bbe142 feat(backend): introduce backend_workspace_t to store precomputed information from the engine folder 2024-12-03 09:43:03 +01:00
Morgan Funtowicz 702dc9cd05 feat(backend): missing return statement 2024-12-03 09:43:03 +01:00
Morgan Funtowicz 87272ffe39 feat(backend): enable compiler warning if support for RVO not applying 2024-12-03 09:43:03 +01:00
Morgan Funtowicz 9bb6309712 feat(backend): added some logging 2024-12-03 09:43:03 +01:00
Morgan Funtowicz 6d3565759a feat(backend): remove all the logs from hardware.hpp 2024-12-03 09:43:03 +01:00