The ability to automatically get more computing power when needed and less when not needed. Like a restaurant that can instantly add more tables during busy times and remove them when quiet.
A website uses scalability to handle millions of visitors during Black Friday sales without crashing, then scales back afterward.
All four services automatically adjust compute capacity (scale out/in or up/down) based on metrics like CPU, memory (where supported), or custom signals to match demand and control cost.