azure

System Design

beginner

API rate limiting and abuse prevention

Rate Limiter System

System Design Classic

A rate limiter protects APIs from abuse and ensures fair resource allocation across clients. This Azure-native design implements multiple algorithms — token bucket for smooth rate limiting, sliding window log for precise counting, and fixed window counter for simplicity. The distributed implementation uses Azure Redis Cache for atomic counter operations across multiple API Management instances, with per-client and per-endpoint configurable limits.

Data Flow

API Management

Rate Limit Checker

Limit Metrics

Counter Store (Redis)

Rate Limit Rules

Share this architecture with your network

Service Breakdown (5 services)

Other5 services

API Management

•Exposes backend services through managed API endpoints
•Enforces authentication, throttling, and quotas
•Provides developer portal and API analytics

Rate Limit Checker

•Executes event-driven functions without managing servers
•Scales based on event volume with consumption billing
•Supports durable functions for stateful workflows

Counter Store (Redis)

•Caches frequently accessed data in-memory
•Reduces database round-trips and latency
•Supports TTL-based expiration policies

Rate Limit Rules

•Provides globally distributed multi-model database
•Guarantees single-digit ms reads worldwide
•Supports five consistency levels

Limit Metrics

•Tracks API call rates and quota consumption
•Emits alerts when rate limits are approached
•Provides dashboards for throttling visibility

Scaling Strategy

Azure Redis Cache provides atomic increment operations for counter-based rate limiting across all API Management instances. Rate limit rules are stored in Cosmos DB and cached locally with short TTLs. The token bucket algorithm runs as an Azure Functions policy at the API Management layer, adding zero latency for requests within limits. Azure Monitor tracks rate limit hits for alerting and capacity planning.

Related Architectures

URL Shortener System

System Design Classic

High-throughput URL shortening service with analytics, custom aliases, and 301/302 redirect handling at scale.

beginner

System Design

Pastebin System

System Design Classic

Text sharing service on OCI with unique short URLs, expiration policies, and read-heavy optimization using Functions and NoSQL.

beginner

System Design

Multi-Tenant SaaS Platform

Generic SaaS

Production-ready multi-tenant SaaS with tenant isolation, feature flags, usage metering, and self-serve onboarding.

intermediate

System Design

Data Lake & Analytics Platform

Modern Data Stack

Cloud-native data lake with streaming ingestion, batch ETL, query engine, and BI dashboards. Handles petabyte-scale analytics.

advanced

System Design

YouTube Video Streaming System

YouTube / Google

Video upload, transcoding, and adaptive bitrate streaming on GCP handling 500+ hours of video uploaded per minute.

advanced

System Design

Notification System

System Design Classic

Multi-channel notification system on Azure supporting push, email, SMS, and in-app notifications with Event Grid fan-out.

intermediate

System Design

Rate Limiter System

Remix this architecture in Canvas