Morgan Funtowicz
|
2b65669581
|
misc(backend): make sure to correctly set IS_GHA_BUILD=true in wf
|
2024-12-17 10:53:42 +01:00 |
Morgan Funtowicz
|
e5cc47a42e
|
misc(backend): missing env directive
|
2024-12-17 10:41:36 +01:00 |
Morgan Funtowicz
|
b303013227
|
misc(backend): let's try with GHA
|
2024-12-17 10:40:51 +01:00 |
Morgan Funtowicz
|
7dcee83a63
|
misc(backend): once more?
|
2024-12-16 16:03:07 +01:00 |
Morgan Funtowicz
|
5ded9cbd22
|
misc(backend): test with TGI S3 conf
|
2024-12-16 15:51:03 +01:00 |
Morgan Funtowicz
|
79f1b953dc
|
misc(backend): test with TGI S3 conf
|
2024-12-16 15:47:51 +01:00 |
Morgan Funtowicz
|
104885c71c
|
misc(backend): indent
|
2024-12-13 15:37:19 +01:00 |
Morgan Funtowicz
|
8d9580669d
|
misc(ci): WAT
|
2024-12-13 09:46:13 +01:00 |
Morgan Funtowicz
|
d5052a9054
|
misc(ci): WAT
|
2024-12-13 09:43:22 +01:00 |
Morgan Funtowicz
|
83e919f617
|
misc(ci): WAT
|
2024-12-12 16:42:38 +01:00 |
Morgan Funtowicz
|
e312a68469
|
misc(ci): WAT
|
2024-12-12 16:14:48 +01:00 |
Morgan Funtowicz
|
4aa060f99a
|
misc(ci): WAT
|
2024-12-12 16:08:01 +01:00 |
Morgan Funtowicz
|
6c62ded864
|
misc(ci): do not build with ssl enabled
|
2024-12-12 15:57:11 +01:00 |
Morgan Funtowicz
|
b31477cf63
|
misc(ci): lets actually use sccache ...
|
2024-12-12 15:38:44 +01:00 |
Morgan Funtowicz
|
68f5466c86
|
misc(ci): add debug profile
|
2024-12-12 14:52:12 +01:00 |
Morgan Funtowicz
|
e37d131b8e
|
misc(ci): add debug profile
|
2024-12-12 14:36:57 +01:00 |
Morgan Funtowicz
|
f99049aafe
|
misc(ci): again
|
2024-12-12 14:24:33 +01:00 |
Morgan Funtowicz
|
20f8cb9a95
|
misc(ci): again
|
2024-12-12 13:11:54 +01:00 |
Morgan Funtowicz
|
e6abfdcb1f
|
misc(ci): let's try this way
|
2024-12-12 13:00:13 +01:00 |
Morgan Funtowicz
|
5f1b16f300
|
misc(ci): export aws creds as output of step
|
2024-12-12 12:48:58 +01:00 |
Morgan Funtowicz
|
5910dabb4e
|
misc(ci): provide mecanism to cache inside container
|
2024-12-12 12:45:14 +01:00 |
Morgan Funtowicz
|
e703c84578
|
misc(ci): let's try to build the Dockerfile for trtllm
|
2024-12-12 11:50:39 +01:00 |
Morgan Funtowicz
|
48a1a602e7
|
misc(ci): update Rust action toolchain
|
2024-12-12 09:40:07 +01:00 |
Morgan Funtowicz
|
de36c8e6dd
|
misc(ci): enabe building tensorrt-llm
|
2024-12-12 09:29:14 +01:00 |
Morgan Funtowicz
|
b653605e54
|
feat(trtllm): fix logits retrieval
|
2024-12-10 23:28:13 +01:00 |
Morgan Funtowicz
|
60059b6968
|
feat(trtllm): expose finish reason to Rust
|
2024-12-10 16:51:22 +01:00 |
Morgan Funtowicz
|
f0cd4742c2
|
misc(backend): fix reborrowing Pin<&mut T> as described in the doc https://doc.rust-lang.org/stable/std/pin/struct.Pin.html#method.as_mut
|
2024-12-05 16:31:19 +01:00 |
Morgan Funtowicz
|
049f4acd5b
|
feat(backend): fix missing "0" field access
|
2024-12-05 15:29:23 +01:00 |
Morgan Funtowicz
|
b3cd5ea076
|
feat(backend): make sure we can easily cancel request on the executor
|
2024-12-05 13:54:56 +01:00 |
Morgan Funtowicz
|
300f6c6f94
|
feat(backend) fix moving backend when pulling
|
2024-12-04 17:32:14 +01:00 |
Morgan Funtowicz
|
460f290d5b
|
effectively cancel the request on the executor
|
2024-12-04 14:29:04 +01:00 |
Morgan Funtowicz
|
b6dbf605af
|
chore(trtllm): update dependency towards 0.15.0
|
2024-12-04 12:02:42 +01:00 |
Morgan Funtowicz
|
cc6bc339e5
|
test(backend): more test coverage
|
2024-12-04 00:16:02 +01:00 |
Morgan Funtowicz
|
62530649b8
|
feat(backend): remove constexpig
|
2024-12-03 16:47:48 +01:00 |
Morgan Funtowicz
|
881527a544
|
feat(backend): remove constexpr from par
|
2024-12-03 16:46:59 +01:00 |
Morgan Funtowicz
|
ad3ed0d1a1
|
test(backend): add more unittest
|
2024-12-03 14:39:10 +01:00 |
Morgan Funtowicz
|
c94b9de445
|
feat(backend): add guard to multiple header definitions
|
2024-12-03 14:07:49 +01:00 |
Morgan Funtowicz
|
16ba2f5a2b
|
feat(backend): fix main.rs retrieving the tokenizer
|
2024-12-03 12:11:17 +01:00 |
Morgan Funtowicz
|
874bc28d6c
|
feat(backend): make backend_workspace_t::engines_folder constexpr
|
2024-12-03 09:43:03 +01:00 |
Morgan Funtowicz
|
2f8634ec01
|
feat(backend): impl missing generation_step_t as return value of pull_tokens
|
2024-12-03 09:43:03 +01:00 |
Morgan Funtowicz
|
a7bad25c41
|
feat(backend): fix backend_exception_t -> backend_error_t naming
|
2024-12-03 09:43:03 +01:00 |
Morgan Funtowicz
|
879e1a4178
|
feat(backend): allow overriding which Python to use
|
2024-12-03 09:43:03 +01:00 |
Morgan Funtowicz
|
71e700a6ea
|
feat(backend): use latest trtllm main version to have g++ >= 13 compatibility
|
2024-12-03 09:43:03 +01:00 |
Morgan Funtowicz
|
fd7e2b5bbd
|
feat(backend): more impl
|
2024-12-03 09:43:03 +01:00 |
Morgan Funtowicz
|
df99164dc1
|
feat(backend): delete previous backend impl
|
2024-12-03 09:43:03 +01:00 |
Morgan Funtowicz
|
25c6bbe142
|
feat(backend): introduce backend_workspace_t to store precomputed information from the engine folder
|
2024-12-03 09:43:03 +01:00 |
Morgan Funtowicz
|
702dc9cd05
|
feat(backend): missing return statement
|
2024-12-03 09:43:03 +01:00 |
Morgan Funtowicz
|
87272ffe39
|
feat(backend): enable compiler warning if support for RVO not applying
|
2024-12-03 09:43:03 +01:00 |
Morgan Funtowicz
|
9bb6309712
|
feat(backend): added some logging
|
2024-12-03 09:43:03 +01:00 |
Morgan Funtowicz
|
6d3565759a
|
feat(backend): remove all the logs from hardware.hpp
|
2024-12-03 09:43:03 +01:00 |