Elasticity
Definition
The ability to dynamically and automatically adjust cloud resources up or down based on actual demand in real-time, ensuring optimal performance.
Use Cases
- Netflix: Handling highly variable streaming demand across regions and time zones — Runs services on AWS and uses Auto Scaling groups to adjust the number of EC2 instances based on demand signals (for example, load and service-level metrics), increasing capacity during peak viewing hours and reducing it during off-peak periods. (Maintains service availability during traffic surges while reducing unnecessary spend during quieter periods by scaling down unused capacity.)
- Airbnb: Managing traffic spikes during major events and seasonal travel peaks — Uses cloud-based scaling patterns (including automated scaling of application tiers) so additional compute capacity can be added when request volume increases and removed when it normalizes. (Improves reliability during sudden demand increases and helps control costs by avoiding permanently over-provisioned infrastructure.)
- Spotify: Supporting fluctuating user activity (new releases, playlists, regional peaks) — Uses cloud infrastructure with automated scaling for services so capacity can expand during high-traffic periods and contract afterward, aligning resources with real usage. (Delivers consistent performance during peak usage while optimizing infrastructure utilization and cost efficiency.)
Provider Equivalents
- AWS: AWS Auto Scaling
- Azure: Azure Autoscale
- GCP: Google Cloud Autoscaler
- OCI: OCI Autoscaling
Frequently Asked Questions
- What's the difference between elasticity and scalability?
- Scalability is the ability of a system to handle growth (more users, more data) by adding resources. Elasticity is how quickly and automatically the system can add or remove those resources to match demand right now. In practice: scalability is the capability; elasticity is the on-demand, often automated execution of that capability.
- When should I use elasticity?
- Use elasticity when your workload demand changes over time and you want to avoid paying for idle capacity. Common cases include web apps with unpredictable traffic, e-commerce during promotions, APIs with bursty usage, batch processing with variable job sizes, and event-driven systems where load can spike suddenly.
- How much does elasticity cost?
- Elasticity itself is usually not a separate line-item cost; you typically pay for the resources you scale to (compute instances, containers, load balancers, storage, and data transfer). Costs depend on peak size, how long you stay at peak, scaling frequency, and whether you use on-demand, reserved/committed use, or spot/preemptible capacity. Elasticity can reduce total cost by scaling down during low demand, but it can increase cost if scaling policies are too aggressive or if traffic spikes are sustained.
Category: cloud
Difficulty: intermediate
Related Terms
See Also