From 5062fda4ffba60fa0f22e69a31722fe6aaebabe3 Mon Sep 17 00:00:00 2001 From: Nicolas Patry Date: Fri, 5 Apr 2024 16:44:10 +0200 Subject: [PATCH] Push users to streaming in the readme. (#1698) --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 60fe83cd..bffe1e8a 100644 --- a/README.md +++ b/README.md @@ -82,7 +82,7 @@ docker run --gpus all --shm-size 1g -p 8080:80 -v $volume:/data ghcr.io/huggingf And then you can make requests like ```bash -curl 127.0.0.1:8080/generate \ +curl 127.0.0.1:8080/generate_stream \ -X POST \ -d '{"inputs":"What is Deep Learning?","parameters":{"max_new_tokens":20}}' \ -H 'Content-Type: application/json'