System Design Classic
A rate limiter protects APIs from abuse and ensures fair resource allocation across clients. This Azure-native design implements multiple algorithms — token bucket for smooth rate limiting, sliding window log for precise counting, and fixed window counter for simplicity. The distributed implementation uses Azure Redis Cache for atomic counter operations across multiple API Management instances, with per-client and per-endpoint configurable limits.
Share this architecture with your network
Azure Redis Cache provides atomic increment operations for counter-based rate limiting across all API Management instances. Rate limit rules are stored in Cosmos DB and cached locally with short TTLs. The token bucket algorithm runs as an Azure Functions policy at the API Management layer, adding zero latency for requests within limits. Azure Monitor tracks rate limit hits for alerting and capacity planning.
Pastebin System
System Design Classic
Multi-Tenant SaaS Platform
Generic SaaS
Data Lake & Analytics Platform
Modern Data Stack
YouTube Video Streaming System
YouTube / Google
Notification System
System Design Classic
Dropbox File Storage System
Dropbox
Rate Limiter System
Remix this architecture in Canvas