With Application Auto Scaling, you can configure automatic scaling for your scalable resources. You can use Application Auto Scaling to accomplish the following tasks:
Define scaling policies to automatically scale your AWS or custom resources
Scale your resources in response to CloudWatch alarms
Schedule one-time or recurring scaling actions
View the history of your scaling events
Application Auto Scaling can scale the following resources:
Amazon ECS services. For more information, see Service Auto Scaling in the Amazon Elastic Container Service Developer Guide.
Amazon EC2 Spot fleets. For more information, see Automatic Scaling for Spot Fleet in the Amazon EC2 User Guide.
Amazon EMR clusters. For more information, see Using Automatic Scaling in Amazon EMR in the Amazon EMR Management Guide.
AppStream 2.0 fleets. For more information, see Fleet Auto Scaling for Amazon AppStream 2.0 in the Amazon AppStream 2.0 Developer Guide.
Provisioned read and write capacity for Amazon DynamoDB tables and global secondary indexes. For more information, see Managing Throughput Capacity Automatically with DynamoDB Auto Scaling in the Amazon DynamoDB Developer Guide.
Amazon Aurora Replicas. For more information, see Using Amazon Aurora Auto Scaling with Aurora Replicas.
Amazon SageMaker endpoint variants. For more information, see Automatically Scaling Amazon SageMaker Models.
Custom resources provided by your own applications or services. More information is available in our GitHub repository.
To learn more about Application Auto Scaling, including information about granting IAM users required permissions for Application Auto Scaling actions, see the Application Auto Scaling User Guide.
Represents a CloudWatch alarm associated with a scaling policy.
A client for the Application Auto Scaling API.
Represents a CloudWatch metric of your choosing for a target tracking scaling policy to use with Application Auto Scaling.
To create your customized metric specification:
For more information about CloudWatch, see Amazon CloudWatch Concepts.
Describes the dimension names and values associated with a metric.
Represents a predefined metric for a target tracking scaling policy to use with Application Auto Scaling.
Represents a scalable target.
Represents the minimum and maximum capacity for a scheduled action.
Represents a scaling activity.
Represents a scaling policy to use with Application Auto Scaling.
Represents a scheduled action.
Represents a step adjustment for a StepScalingPolicyConfiguration. Describes an adjustment based on the difference between the value of the aggregated CloudWatch metric and the breach threshold that you've defined for the alarm.
For the following examples, suppose that you have an alarm with a breach threshold of 50:
There are a few rules for the step adjustments for your step policy:
Represents a step scaling policy configuration to use with Application Auto Scaling.
Represents a target tracking scaling policy configuration to use with Application Auto Scaling.
Errors returned by DeleteScalingPolicy
Errors returned by DeleteScheduledAction
Errors returned by DeregisterScalableTarget
Errors returned by DescribeScalableTargets
Errors returned by DescribeScalingActivities
Errors returned by DescribeScalingPolicies
Errors returned by DescribeScheduledActions
Errors returned by PutScalingPolicy
Errors returned by PutScheduledAction
Errors returned by RegisterScalableTarget
Trait representing the capabilities of the Application Auto Scaling API. Application Auto Scaling clients implement this trait.