What is the latency and cost of A100 gpu
Hi team, I want to deploy fine-tuned model on Azure. If I use llama3.1-8b model, the input is 4k token and output is 512 token. What is the latency if I deploy the model to A100 gpu ? What is the economy cost for each A100 gpu ? Do we support vllm…
How to do A/B test on Azure
Hi Team, I want to deploy fine-tuned Qwen2.5B model on Azure, how do I do A/B test or online experiment for model version A and model version B ?
(new)AML Continuous Monitoring | Mapping Ground Truth Data for Model Performance Monitoring in Azure Machine Learning
This is the new post as my first post couldn't be viewed( shown 404 error after I clicked it) Greetings, I am reading the document regarding MLOps and would like to continuously monitor the ML model deployed inside Azure Machine Learning. The goal of…
Issue with the sandbox being unavailable on Microsoft Azure Fundamentals
Hey everyone, I’ve been trying to use the MS Learn Sandbox for a module, but I keep running into this annoying error message: “The sandbox for this module is currently unavailable. We're working to resolve this as quickly as possible. In the meantime,…
Feature request for Azure ML online endpoints
Hi, I use AML online endpoints a lot. The "test" panel is very helpful. However, the data in the test request gets zeroed out when I navigate off this page. I currently duplicate the browser tab and navigate to the "logs" panel so I…
How to deploy an LLM in a batch endpoint in Azure Machine Learning? Any alternatives?
Hi, I'm trying to deploy an LLM in Azure Machine Learning in a batch endpoint. I was successful in creating an online endpoint, but facing issues with batch endpoint. The goal of the project is to deploy an LLM that will run every morning and produce…
AZURE ML STUDIO (FACING PROBLEM IN DEPLOYMENT)
Not enough quota available for Standard_D2as_v4. Current usage/limit: 0/4. Additional requested: 8 (20% more quota than expected may be needed for redundancy purposes on some SKUs) I am facing a problem while deploying model in selecting Virtual machine…
Automl Model Deploy Fails On Gunicorn Version Conflict
Hello, I am attempting to deploy an automl model to a real time endpoint, when I do I am getting a pip error about dependency conflicts between gunicorn and mlflow. It seems these install are dynamically added by azure? What can I do to get past this…
Azure Machine Learning Studio creates v1 deployments that can't be tested in Studio
If you use AutoML in Azure Machine Learning Studio to train a model and then deploy that model to an Azure Container Instance, you can't test the deployment using Studio. When you click on the Test tab, it comes back with "This deployment is based…
Not able to subscribe LLama-3.3-70B-Instruct with Microsoft Azure Sponsorship Account
Hello Team, Greetings of the day!! I would like to inform you that one of my customers is not able to subscribe LLama-3.3-70B-Instruct with Microsoft Azure Sponsorship Account. He attempted to subscribe to the LLama-3.3-70B-Instruct via both the…
what is difference between AKS and machine learning compute
Azure offers managed compute resources, including Azure Kubernetes Service (AKS) or Azure Machine Learning Compute. I want to upload fine-tuned LLM like Qwen2.5 or llama3.1 to Azure and online serving with these fine-tuned models, what is difference…
"Error 'Resource provider not registered' when deploying model in Azure ML Tutorial (Free Trial)"
Hi, I'm following the Microsoft Learn tutorial "Create a classification model with Azure Machine Learning designer" (https://microsoftlearning.github.io/mslearn-ai-fundamentals/Instructions/Labs/01-machine-learning.html) with an Azure Free…
Private Azure Machine Learning Workspace getting forbidden to reach private Storage Account for SAS
My Azure Machine learning workspace is configured with private endpoint , my aml is configured to a Storage account which is also configured private endpoint and disabled the public access, My Storage Account network will looks like below The…
How to rerun successful AzureML Pipeline
I have a successful pipeline that writes files to Blob storage. If I delete the files in blob storage and try to rerun the pipeline, the pipeline run is cached so the pipeline doesn't actually rerun. Is there a way to force rerun the pipeline?
I am trying to schedule my azure ml command job. It runs fine the first time but from second occurrence, it directly goes into completed state!
I am trying to schedule a command job using bicep. It gets executed as expected the first time but then from 2nd time the job directly goes into completed state. resource jobResource 'Microsoft.MachineLearningServices/workspaces/schedules@2023-10-01' =…
Why can't I select Standard_NC6s_v3 as my endpoint Virtual Machine?
I'm trying to deploy my endpoint, and I wish to select Standard_NC6s_v3 as my VM, but the issue is that it only allows me to select up to these VMs. The NC6s falls into the following category (not enough quota). At first, I thought that it was related…
How to do the load balancer, tracking event and log
Hi Team, I want to deploy my fine-tuned LLM like Qwen2.5 model to Azure: How do you do the load balancer between different host or gpus ? How do you record the tracking and log on the Azure disk space ?
AzureMLException: Error in Streaming from Input Data Sources
An AzureMLException is encountered with the message "Error Code: ScriptExecution.StreamAccess.Unexpected Native Error: error in streaming from input data sources" when using the 'Apply SQL Transformation' component in the Azure ML designer…
In Azure ML studio deploy option is not there
Hi team im new to Azure ML studio in that i done trained data but i would like to deploy im in trial account i dont see option for deploy i can able to see only submit , share like that only please help