hf_text-generation-inference/server
OlivierDehaene 3cf6368c77 feat(server): Support all AutoModelForCausalLM on a best effort basis 2022-10-28 19:24:00 +02:00
..
text_generation feat(server): Support all AutoModelForCausalLM on a best effort basis 2022-10-28 19:24:00 +02:00
.gitignore feat(server): Support all AutoModelForCausalLM on a best effort basis 2022-10-28 19:24:00 +02:00
Makefile feat(server): Support all AutoModelForCausalLM on a best effort basis 2022-10-28 19:24:00 +02:00
README.md feat(server): Use safetensors 2022-10-22 20:00:15 +02:00
poetry.lock feat(server): Support bitsandbytes 2022-10-27 14:25:29 +02:00
pyproject.toml feat(server): Support all AutoModelForCausalLM on a best effort basis 2022-10-28 19:24:00 +02:00

README.md

BLOOM Inference Python gRPC Server

A Python gRPC server for BLOOM Inference

Install

make install

Run

make run-dev