Auto Scaling

Autoscaling refers to increasing or decreasing the number of instances according to predefined rules. For example, if there are more than 1,000 requests in one hour, you can increase the number of instances by 2.

Adding Autoscaling Conditions

You can select "Add scale condition" to create a new autoscaling condition.

Scale Conditions

Here are the available conditions you can choose from:

  • Metric

    • Select the metric that will determine the increase or decrease of instances.

    • You can choose from total requests, requests per second, or latency in milliseconds.

  • Target Value and Operator

    • Compare the metric value with the target value using the operator.

    • For example, for the total requests metric, you can select "Greater than or equal to" for a target value of 50.

  • Measurement Period

    • The period for fetching the metric value.

    • For example, for the total requests metric, if the measurement period is 10 minutes, it will aggregate the total number of requests in the last 10 minutes.

  • Cooldown

    • The amount of time after autoscaling has occurred during which no additional actions will be taken.

    • You can set this value to limit how frequently autoscaling occurs.

  • Action

    • Select the desired action, such as increasing or decreasing the number of instances.

    • For example, if you select "Decrease instances" with a count of 1, when the specified metric condition is met, one instance will be decreased.

Autoscaling History

This is a record of when the specified autoscaling conditions were triggered. You can check when the conditions were met.

Last updated