preemo_text-generation-infe.../aml
OlivierDehaene 20c3c5940c
feat(router): refactor API and add openAPI schemas (#53)
2023-02-03 12:43:37 +01:00
..
README.md Update aml deployment 2022-10-17 10:39:59 +02:00
deployment.yaml feat(router): refactor API and add openAPI schemas (#53) 2023-02-03 12:43:37 +01:00
endpoint.yaml Update aml deployment 2022-10-17 10:39:59 +02:00
model.yaml feat(server): Support all AutoModelForCausalLM on a best effort basis 2022-10-28 19:24:00 +02:00

README.md

docker build . -t db4c2190dd824d1f950f5d1555fbadf0.azurecr.io/text-generation:0.1
docker push db4c2190dd824d1f950f5d1555fbadf0.azurecr.io/text-generation:0.1

az ml model create -f model.yaml -g HuggingFace-BLOOM-ModelPage -w HuggingFace
az ml online-endpoint create -f endpoint.yaml -g HuggingFace-BLOOM-ModelPage -w HuggingFace
az ml online-deployment create -f deployment.yaml -g HuggingFace-BLOOM-ModelPage -w HuggingFace