Automatically scale a Virtual Machine Scale Set in the Azure portal

Article
08/22/2024

When you create a scale set, you define the number of VM instances that you wish to run. As your application demand changes, you can automatically increase or decrease the number of VM instances. The ability to autoscale lets you keep up with customer demand or respond to application performance changes throughout the lifecycle of your app.

This article shows you how to create autoscale rules in the Azure portal that monitor the performance of the VM instances in your scale set. These autoscale rules increase or decrease the number of VM instances in response to these performance metrics. You can also complete these steps with Azure PowerShell or the Azure CLI.

Prerequisites

To create autoscale rules, you need an existing Virtual Machine Scale Set. You can create a scale set with the Azure portal, Azure PowerShell, or Azure CLI.

Create a rule to automatically scale out

If your application demand increases, the load on the VM instances in your scale set increases. If this increased load is consistent, rather than just a brief demand, you can configure autoscale rules to increase the number of VM instances in the scale set. When these VM instances are created and your applications are deployed, the scale set starts to distribute traffic to them through the load balancer. You control what metrics to monitor, such as CPU or disk, how long the application load must meet a given threshold, and how many VM instances to add to the scale set.

Open the Azure portal and select Resource groups from the menu on the left-hand side of the dashboard.
Select the resource group that contains your scale set, then choose your scale set from the list of resources.
Choose Scaling from the menu on the left-hand side of the scale set window. Select the button to Custom autoscale:
Select the option to Add a rule.

Let's create a rule that increases the number of VM instances in a scale set when the average CPU load is greater than 70% over a 10-minute period. When the rule triggers, the number of VM instances is increased by 20%. In scale sets with a small number of VM instances, you could set the Operation to Increase count by and then specify 1 or 2 for the Instance count. In scale sets with a large number of VM instances, an increase of 10% or 20% VM instances may be more appropriate.

Specify the following settings for your rule:

Parameter	Explanation	Value
Time Aggregation	Defines how the collected metrics should be aggregated for analysis.	Average
Metric Name	The performance metric to monitor and apply scale set actions on.	Percentage CPU
Time grain statistic	Defines how the collected metrics in each time grain should be aggregated for analysis.	Average
Operator	Operator used to compare the metric data against the threshold.	Greater than
Threshold	The percentage that causes the autoscale rule to trigger an action.	70
Duration	The amount of time monitored before the metric and threshold values are compared. Does not include cool down period.	10 minutes
Operation	Defines if the scale set should scale up or down when the rule applies and by what increment.	Increase percent by
Instance count	The percentage of VM instances should be changed when the rule triggers.	20
Cool down (minutes)	The amount of time to wait before the rule is applied again so that the autoscale actions have time to take effect.	5 minutes

The following examples show a rule created in the Azure portal that matches these settings:

Create an autoscale rule to increase the number of VM instances

Note

Tasks running inside the instance will abruptly stop and the instance will scale down once it completes the cooling period.

To create the rule, select Add

Create a rule to automatically scale in

On an evening or weekend, your application demand may decrease. If this decreased load is consistent over a period of time, you can configure autoscale rules to decrease the number of VM instances in the scale set. This scale-in action reduces the cost to run your scale set as you only run the number of instances required to meet the current demand.

Choose to Add a rule again.

Create a rule that decreases the number of VM instances in a scale set when the average CPU load then drops below 30% over a 10-minute period. When the rule triggers, the number of VM instances is decreased by 20%.

Use the same approach as with the previous rule. Adjust the following settings for your rule:

Parameter	Explanation	Value
Operator	Operator used to compare the metric data against the threshold.	Less than
Threshold	The percentage that causes the autoscale rule to trigger an action.	30
Operation	Defines if the scale set should scale up or down when the rule applies and by what increment	Decrease percent by
Instance count	The percentage of VM instances should be changed when the rule triggers.	20

To create the rule, select Add

Define autoscale instance limits

Your autoscale profile must define a minimum, maximum, and default number of VM instances. When your autoscale rules are applied, these instance limits make sure that you do not scale out beyond the maximum number of instances, or scale in beyond the minimum of instances.

Set the following instance limits:

Minimum Maximum Default

2 10 2
To apply your autoscale rules and instance limits, select Save.

Minimum	Maximum	Default
2	10	2

Monitor number of instances in a scale set

To see the number and status of VM instances, select Instances from the menu on the left-hand side of the scale set window. The status indicates if the VM instance is Creating as the scale set automatically scales out, or is Deleting as the scale automatically scales in.

View a list of scale set VM instances

Autoscale based on a schedule

The previous examples automatically scaled a scale set in or out with basic host metrics such as CPU usage. You can also create autoscale rules based on schedules. These schedule-based rules allow you to automatically scale out the number of VM instances ahead of an anticipated increase in application demand, such as core work hours, and then automatically scale in the number of instances at a time that you anticipate less demand, such as the weekend.

Choose Scaling from the menu on the left-hand side of the scale set window.
Choose to Add a scale condition. Select the pencil icon next to rule name, and provide a name such as Scale out during each work day.
Select the radio button to Scale to a specific instance count.
To scale up the number of instances, enter 10 as the instance count.
Choose Repeat specific days for the Schedule type.
Select all the work days, Monday through Friday.
Choose the appropriate timezone, then specify a Start time of 09:00.
Choose to Add a scale condition again. Repeat the process to create a schedule named Scale in during the evening that scales to 3 instances, repeats every weekday, and starts at 18:00.
To apply your schedule-based autoscale rules, select Save.

To see how your autoscale rules are applied, select Run history across the top of the Scaling window. The graph and events list shows when the autoscale rules trigger and the number of VM instances in your scale set increases or decreases.

Next steps

In this article, you learned how to use autoscale rules to scale horizontally and increase or decrease the number of VM instances in your scale set. For information on how to manage your VM instances, see Manage Virtual Machine Scale Sets with Azure PowerShell.

To learn how to generate alerts when your autoscale rules trigger, see Use autoscale actions to send email and webhook alert notifications in Azure Monitor. You can also Use audit logs to send email and webhook alert notifications in Azure Monitor.

Share via