Scalability

Definition

The ability to automatically get more computing power when needed and less when not needed, optimizing resource use and cost efficiency.

Use Cases

Provider Equivalents

Frequently Asked Questions

What's the difference between scalability and elasticity?
Scalability is the ability of a system to handle growth by adding resources (more servers, bigger servers, or both). Elasticity is a type of scalability where capacity changes automatically and quickly to match demand (scale up during spikes, scale down after).
When should I use scalability?
Use scalability when your workload demand changes over time (daily peaks, seasonal traffic, marketing campaigns) or when you expect growth. It’s especially useful for web apps, APIs, e-commerce, streaming, and data processing jobs where you want to avoid outages during spikes and avoid paying for idle capacity during slow periods.
How much does scalability cost?
Scalability itself is usually enabled by services that may have little or no direct fee, but you pay for the resources that get added (compute instances, containers, database capacity, load balancers, and network egress). Costs depend on scaling frequency, peak size, how long you stay at peak, instance types, and whether you use cost optimizations like reserved capacity, savings plans/commitments, or spot/preemptible instances.

Category: cloud

Difficulty: basic

Related Terms

See Also