YouTube / Google
YouTube handles over 500 hours of video uploads every minute and serves billions of views daily. This GCP-native architecture separates the upload pipeline (chunked upload → transcoding via GKE → multiple resolutions in Cloud Storage) from the viewing pipeline (Cloud CDN → adaptive bitrate streaming). Aimed at engineers building video platforms that need adaptive bitrate streaming, automated moderation, and personalized recommendations.
Share this architecture with your network
Video uploads are chunked and stored in Cloud Storage before transcoding. Pub/Sub distributes transcoding jobs across GKE workers that auto-scale based on subscription backlog. Each video produces multiple resolution variants stored in Cloud Storage. Viewing traffic scales through Cloud CDN edge caching with adaptive bitrate manifests. Metadata queries hit Memorystore first, falling back to Cloud SQL read replicas.
Data Lake & Analytics Platform
Modern Data Stack
Web Crawler System
System Design Classic
Multi-Tenant SaaS Platform
Generic SaaS
Notification System
System Design Classic
Dropbox File Storage System
Dropbox
Pastebin System
System Design Classic
YouTube Video Streaming System
Remix this architecture in Canvas