From 08b39d1ae24e1ddbe53425b8b58f7fa3a15e22f9 Mon Sep 17 00:00:00 2001 From: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com> Date: Fri, 20 Dec 2024 11:20:51 +0100 Subject: [PATCH] Fix `docker run` in `README.md` --- README.md | 3 +-- 1 file changed, 1 insertion(+), 2 deletions(-) diff --git a/README.md b/README.md index 6d3a9b12..2e507c5d 100644 --- a/README.md +++ b/README.md @@ -83,8 +83,7 @@ model=HuggingFaceH4/zephyr-7b-beta # share a volume with the Docker container to avoid downloading weights every run volume=$PWD/data -docker run --gpus all --shm-size 1g -p 8080:80 -v $volume:/data \ -3.0.0 ghcr.io/huggingface/text-generation-inference:3.0.0 --model-id $model +docker run --gpus all --shm-size 1g -p 8080:80 -v $volume:/data ghcr.io/huggingface/text-generation-inference:3.0.0 --model-id $model ``` And then you can make requests like