azure
System Design
intermediate
Cloud file storage and sync

Dropbox File Storage System

Dropbox

Dropbox synchronizes billions of files across hundreds of millions of devices by chunking files into 4MB blocks and deduplicating at the block level, storing identical content only once. Delta sync transmits only changed blocks, dramatically reducing bandwidth while maintaining consistency across all connected devices. Ideal for teams building cloud file sync products that need block-level deduplication, delta transfers, and cross-device consistency.

Data Flow

Sync API
Sync Queue
Upload Service
Sync Service
Block Storage
File Metadata DB
Folder Cache
Dedup Checker

Share this architecture with your network

Service Breakdown (8 services)

Other8 services
Sync API
  • Exposes backend services through managed API endpoints
  • Enforces authentication, throttling, and quotas
  • Provides developer portal and API analytics
Upload Service
  • Validates and ingests video uploads from creators
  • Extracts metadata and generates processing tasks
  • Enforces content size and format restrictions
Sync Service
  • Detects file changes and synchronizes across devices
  • Resolves conflicts with last-writer-wins strategy
  • Minimizes bandwidth with delta-based sync
Block Storage
  • Stores file chunks as content-addressed blocks
  • Deduplicates identical blocks across all users
  • Serves block retrieval with high read throughput
File Metadata DB
  • Runs managed SQL databases with built-in intelligence
  • Supports elastic pools for multi-database efficiency
  • Handles automated patching and geo-replication
Folder Cache
  • Caches frequently accessed data in-memory
  • Reduces database round-trips and latency
  • Supports TTL-based expiration policies
Sync Queue
  • Provides reliable enterprise message brokering
  • Supports topics and queue-based messaging
  • Guarantees at-least-once delivery
Dedup Checker
  • Executes event-driven functions without managing servers
  • Scales based on event volume with consumption billing
  • Supports durable functions for stateful workflows

Scaling Strategy

File blocks are stored in Azure Blob Storage with content-addressed naming (SHA-256 hash) for automatic deduplication. The metadata service uses Azure SQL for ACID-compliant file tree operations with Redis Cache for hot folder lookups. Upload and sync notifications flow through Service Bus, and Azure Functions processes deduplication checks. The sync protocol scales by only transmitting block diffs rather than full files.

Related Architectures