hf_text-generation-inference/aml
OlivierDehaene ab2ad91da3
fix(docker): fix api-inference deployment (#30)
2023-01-23 17:33:08 +01:00
..
README.md Update aml deployment 2022-10-17 10:39:59 +02:00
deployment.yaml fix(docker): fix api-inference deployment (#30) 2023-01-23 17:33:08 +01:00
endpoint.yaml Update aml deployment 2022-10-17 10:39:59 +02:00
model.yaml feat(server): Support all AutoModelForCausalLM on a best effort basis 2022-10-28 19:24:00 +02:00

README.md

docker build . -t db4c2190dd824d1f950f5d1555fbadf0.azurecr.io/text-generation:0.1
docker push db4c2190dd824d1f950f5d1555fbadf0.azurecr.io/text-generation:0.1

az ml model create -f model.yaml -g HuggingFace-BLOOM-ModelPage -w HuggingFace
az ml online-endpoint create -f endpoint.yaml -g HuggingFace-BLOOM-ModelPage -w HuggingFace
az ml online-deployment create -f deployment.yaml -g HuggingFace-BLOOM-ModelPage -w HuggingFace