...
There are few auto scaling decision making variables, out of which all of the vital variables are configurable using the loadbalancer.conf
file. (For sample configuration files please see Sample Autoscaling Configurations)
- autoscaler_task_interval (t)
This refers to the time period between two iterations of an 'autoscaling decision making task'. When configuring this value, you are advised to consider the time ‘that a service instance takes to join Elastic Load Balancer (ELB)’. This variable is stated milliseconds and the default value is 30000ms. - max_requests_per_second (Rps)
This refers to the number of requests, a service instance can withstand per second. It is recommended that you determine this value for each service instance and for different scenarios. The ideal way to estimate this value could be by load testing a similar service instance. The default value is 100. - rounds_to_average (r)
This refers to the number of ‘autoscaling decision making’ task iterations that need to take place before an autoscaling decision is made. The default value is 10. - alarming_upper_rate (AUR)
Instead of waiting till the service instance reaches its maximum request capacity (where the alarming_upper_rate = 1), the system will be scaled up when it reaches the request capacity which corresponds to alarming_upper_rate. This value should be 0<AUR<=1. The default value is 0.7. - alarming_lower_rate (ALR)
This refers to the lower bound of the alarming rate. This variable an be used as an indicator to decide when the system should be scaled down. This value should be 0<ALR<=1. The default value is 0.2. - scale_down_factor (SDF)
This refers to the factor that is needed to slow down the scaling down process. To reduce the scaling down due to a false-positive events, the scaling do wn process needs to be carried out slowly. This value should be 0<SDF<=1. The default value is 0.25.
...