preemo_text-generation-infe.../server
OlivierDehaene ce960be0a5
feat(bloom): use torch.nn.Linear and torch.nn.GELU (#33)
2023-01-26 15:33:45 +01:00
..
tests fix(server): Fix position ids (#28) 2023-01-20 15:35:22 +01:00
text_generation feat(bloom): use torch.nn.Linear and torch.nn.GELU (#33) 2023-01-26 15:33:45 +01:00
.gitignore feat(server): Support all AutoModelForCausalLM on a best effort basis 2022-10-28 19:24:00 +02:00
Makefile fix(dockerfile): fix docker build (#32) 2023-01-24 19:52:39 +01:00
README.md feat(server): Use safetensors 2022-10-22 20:00:15 +02:00
poetry.lock feat(launcher): Log server stdout (#19) 2023-01-05 12:01:23 +01:00
pyproject.toml feat(launcher): Log server stdout (#19) 2023-01-05 12:01:23 +01:00

README.md

BLOOM Inference Python gRPC Server

A Python gRPC server for BLOOM Inference

Install

make install

Run

make run-dev