Auto Scaling
Autoscaling refers to increasing or decreasing the number of instances according to predefined rules. For example, if there are more than 1,000 requests in one hour, you can increase the number of instances by 2.
Adding Autoscaling Conditions
You can select "Add scale condition" to create a new autoscaling condition.
Scale Conditions
Here are the available conditions you can choose from:
Metric
Select the metric that will determine the increase or decrease of instances.
You can choose from total requests, requests per second, or latency in milliseconds.
Target Value and Operator
Compare the metric value with the target value using the operator.
For example, for the total requests metric, you can select "Greater than or equal to" for a target value of 50.
Measurement Period
The period for fetching the metric value.
For example, for the total requests metric, if the measurement period is 10 minutes, it will aggregate the total number of requests in the last 10 minutes.
Cooldown
The amount of time after autoscaling has occurred during which no additional actions will be taken.
You can set this value to limit how frequently autoscaling occurs.
Action
Select the desired action, such as increasing or decreasing the number of instances.
For example, if you select "Decrease instances" with a count of 1, when the specified metric condition is met, one instance will be decreased.
Autoscaling History
This is a record of when the specified autoscaling conditions were triggered. You can check when the conditions were met.
Last updated