Commit Graph

73 Commits

Author SHA1 Message Date
Guillaume LEGENDRE 6de10b659d
new tailscale action 2024-03-18 10:42:14 +01:00
Guillaume LEGENDRE c64866e05a
exclude ubuntu.com domain 2024-02-21 19:45:45 +01:00
Guillaume LEGENDRE e61f124f63
fix 2024-02-21 19:33:37 +01:00
Guillaume LEGENDRE 710b760602
fix typo 2024-02-21 19:28:48 +01:00
Guillaume LEGENDRE 3a85f1bd54
try fixing buildx proxy 2024-02-21 19:27:28 +01:00
Guillaume LEGENDRE d0d0fd24a8
update tailscale action version 2024-02-21 15:43:58 +01:00
Guillaume LEGENDRE 92ab9d2ee6
change runner and remove tailscale userspace for amd 2024-02-21 15:41:05 +01:00
Guillaume LEGENDRE 383478758b
fix tailscale 2024-02-21 15:36:48 +01:00
Nicolas Patry ab60d15962 Desperate attempt. 2024-02-14 10:27:21 +00:00
Nicolas Patry 584c5fa0a0 Tailscale. 2024-02-14 10:22:36 +00:00
Nicolas Patry 212e1cbcbe no sudo. 2024-02-14 10:19:46 +00:00
Nicolas Patry ffa1804a34 .. 2024-02-14 10:16:24 +00:00
Nicolas Patry 5b0befee43 Test. 2024-02-14 10:13:45 +00:00
Nicolas Patry df91f105e8 Ofc. 2024-02-14 10:11:22 +00:00
Nicolas Patry 7f0a816a22 Maybe XML wasn't so bad after all. 2024-02-14 10:10:28 +00:00
Nicolas Patry b1aff577a0 Worse invention ever. 2024-02-14 10:09:00 +00:00
Nicolas Patry 0523031ffb ... 2024-02-14 10:05:29 +00:00
Nicolas Patry 69d1d3cde6 Bahs in yaml is not our friend. 2024-02-14 10:02:53 +00:00
Nicolas Patry e36887cbf5 Install docker manually. 2024-02-14 10:00:33 +00:00
Nicolas Patry 05aef4dd1a Upgrade install buildx. 2024-02-14 09:57:15 +00:00
Nicolas Patry 85bf172653 Our runner docker in docker. 2024-02-14 09:52:34 +00:00
Nicolas Patry c54b5c7f04 Remove tailscale. 2024-02-13 17:51:12 +01:00
Nicolas Patry a83772c87b Self hosted for nvidia too. 2024-02-13 17:31:39 +01:00
Nicolas Patry 31d965bf17 Our runner. 2024-02-13 17:15:45 +01:00
drbh c5ef81bed5
chore: bump ci rust version (#1543)
This PR bumps the rust toolchain in CI to resolve the CI build issue

```bash
  Downloaded crossbeam-utils v0.8.19
  Downloaded crc32fast v1.3.2
error: failed to compile `text-generation-router v1.4.0 (/home/runner/work/text-generation-inference/text-generation-inference/router)`, intermediate artifacts can be found at `/home/runner/work/text-generation-inference/text-generation-inference/target`

Caused by:
  package `clap_lex v0.7.0` cannot be built because it requires rustc 1.74 or newer, while the currently active rustc version is 1.71.0
  Either upgrade to rustc 1.74 or newer, or use
  cargo update -p clap_lex@0.7.0 --precise ver
  where `ver` is the latest version of `clap_lex` supporting rustc 1.71.0
make: *** [Makefile:12: install-router] Error 101
```
2024-02-09 10:32:04 +01:00
OlivierDehaene c2d4a3b5c7
v1.4.0 (#1494) 2024-01-26 19:04:57 +01:00
OlivierDehaene 9b56d3fbf5
feat: relax mistral requirements (#1351)
Close #1253 
Close #1279
2023-12-15 12:52:24 +01:00
Nicolas Patry 3238c49121
Add a stale bot. (#1313) 2023-12-05 14:42:55 +01:00
fxmarty b2b5df0e94
Add RoCm support (#1243)
This PR adds support for AMD Instinct MI210 & MI250 GPUs, with paged
attention and FAv2 support.

Remaining items to discuss, on top of possible others:
* Should we have a
`ghcr.io/huggingface/text-generation-inference:1.1.0+rocm` hosted image,
or is it too early?
* Should we set up a CI on MI210/MI250? I don't have access to the
runners of TGI though.
* Are we comfortable with those changes being directly in TGI, or do we
need a fork?

---------

Co-authored-by: Felix Marty <felix@hf.co>
Co-authored-by: OlivierDehaene <olivier@huggingface.co>
Co-authored-by: Your Name <you@example.com>
2023-11-27 14:08:12 +01:00
OlivierDehaene 8acdc1fae7 hotfix 1.1.1 2023-11-16 18:35:09 +01:00
Remy 72b8f88be8
fix: remove useless token (#1179)
This token is not used by your action.
Secret is removed from the repository.
2023-10-19 14:04:44 +02:00
Merve Noyan 259a230028
Automatic docs for TGI (#1045)
I had to open this PR since I initially worked from my fork, and it
requires a handful of work to trigger a new github action on my fork's
specific branch (couldn't find a way, at least, despite trying all of
them).

---------

Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
2023-09-27 16:01:38 +02:00
Mishig 5df4c7c0d7
[docs] Build docs only when doc files change (#812)
Build docs only when change happens in `docs/source`

See for example
https://github.com/huggingface/api-inference/blob/main/.github/workflows/build_documentation.yml#L3-L8
2023-08-11 07:07:53 +02:00
Merve Noyan 647ae7a7d3
Setup for doc-builder and docs for TGI (#740)
I added ToC for docs v1 & started setting up for doc-builder. cc @Narsil
@osanseviero

---------

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: osanseviero <osanseviero@gmail.com>
Co-authored-by: Mishig <mishig.davaadorj@coloradocollege.edu>
2023-08-10 10:24:52 +02:00
Nicolas Patry 92bb56b0c1
Local gptq support. (#738)
# What does this PR do?

Redoes #719

<!--
Congratulations! You've made it this far! You're not quite done yet
though.

Once merged, your PR is going to appear in the release notes with the
title you set, so make sure it's a great title that fully reflects the
extent of your awesome contribution.

Then, please replace this with a description of the change and which
issue is fixed (if applicable). Please also include relevant motivation
and context. List any dependencies (if any) that are required for this
change.

Once you're done, someone will review your PR shortly (see the section
"Who can review?" below to tag some potential reviewers). They may
suggest changes to make the code even better. If no one reviewed your PR
after a week has passed, don't hesitate to post a new comment
@-mentioning the same persons---sometimes notifications get lost.
-->

<!-- Remove if not applicable -->

Fixes # (issue)


## Before submitting
- [ ] This PR fixes a typo or improves the docs (you can dismiss the
other checks if that's the case).
- [ ] Did you read the [contributor
guideline](https://github.com/huggingface/transformers/blob/main/CONTRIBUTING.md#start-contributing-pull-requests),
      Pull Request section?
- [ ] Was this discussed/approved via a Github issue or the
[forum](https://discuss.huggingface.co/)? Please add a link
      to it if that's the case.
- [ ] Did you make sure to update the documentation with your changes?
Here are the
[documentation
guidelines](https://github.com/huggingface/transformers/tree/main/docs),
and
[here are tips on formatting
docstrings](https://github.com/huggingface/transformers/tree/main/docs#writing-source-documentation).
- [ ] Did you write any new necessary tests?


## Who can review?

Anyone in the community is free to review the PR once the tests have
passed. Feel free to tag
members/contributors who may be interested in your PR.

<!-- Your PR will be replied to more quickly if you can figure out the
right person to tag with @


@OlivierDehaene OR @Narsil

 -->
2023-07-31 10:32:52 +02:00
Nicolas Patry f063ebde10
chore: migrate ci region for more availability. (#581) 2023-07-12 10:01:01 +02:00
OlivierDehaene e3e487dc71
feat(server): support trust_remote_code (#363) 2023-05-23 20:40:39 +02:00
OlivierDehaene 5f67923cac
feat: add nightly load testing (#358) 2023-05-23 17:42:19 +02:00
oOraph 0a6494785c
fix(ci): fix security group (#359)
# What does this PR do?
Switch security group used for ci
(open outbound rules)

Signed-off-by: Raphael <oOraph@users.noreply.github.com>
Co-authored-by: Raphael <oOraph@users.noreply.github.com>
2023-05-23 16:49:11 +02:00
OlivierDehaene 5a58226130
fix(server): fix decode token (#334)
Fixes #333

---------

Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
2023-05-16 23:23:27 +02:00
OlivierDehaene dbdc587ddd
feat(integration-tests): improve comparison and health checks (#336) 2023-05-16 20:22:11 +02:00
OlivierDehaene e71471bec9
feat: add snapshot testing (#282) 2023-05-15 23:36:30 +02:00
OlivierDehaene 66b277321d
feat(ci): custom gpu runners (#328) 2023-05-15 15:53:08 +02:00
Nicolas Patry 411b0d4e1f
chore(github): add templates (#264) 2023-05-02 15:43:19 +02:00
Ehsan M. Kermani f092ba9b22
feat(server): add watermarking tests (#248) 2023-04-27 19:16:35 +02:00
Nicolas Patry db2b4e0754
feat(router): new healthcheck that skips the queue (#244)
Co-authored-by: OlivierDehaene <23298448+OlivierDehaene@users.noreply.github.com>
Co-authored-by: OlivierDehaene <olivier@huggingface.co>
2023-04-26 20:23:54 +02:00
Nicolas Patry c4fb09f2ae
feat(router): add tests to validation (#237) 2023-04-26 16:14:40 +02:00
OlivierDehaene 274513e6a3
fix(ci): fix sha in docker image (#212) 2023-04-20 18:50:47 +02:00
OlivierDehaene 709d8936f6
feat(router): drop requests when client closes the channel (#202) 2023-04-20 11:07:40 +02:00
OlivierDehaene b6ee0ec7b0
feat(router): add git sha to info route (#208) 2023-04-19 21:36:59 +02:00