Model Monitoring

Definition

Continuously tracking AI model performance, data quality, and system health in production to detect issues early and ensure reliability.

Use Cases

Provider Equivalents

Frequently Asked Questions

What's the difference between Model Monitoring and Observability?
Model monitoring focuses specifically on ML behavior in production—things like prediction accuracy, data drift, bias signals, and feature quality. Observability is broader and covers the whole system (services, infrastructure, logs, traces). In practice, model monitoring is usually built on top of observability tools, adding ML-specific metrics and checks.
When should I use Model Monitoring?
Use it whenever an ML model is making production decisions that matter (revenue, safety, compliance, customer experience). It’s especially important when data changes over time (seasonality, new user behavior, new products), when you can’t easily see errors immediately, or when you need auditability for regulated use cases.
How much does Model Monitoring cost?
Cost depends on (1) how much data you log (features, predictions, labels), (2) how often you run monitoring jobs (real-time vs hourly/daily), (3) storage and retention, (4) compute for drift/quality calculations, and (5) alerting and dashboarding tools. Managed services (e.g., SageMaker Model Monitor or Vertex AI Model Monitoring) typically charge for monitoring compute and associated logging/storage; DIY approaches shift costs to your own compute, metrics, and log storage.

Category: ai-ml

Difficulty: intermediate

Related Terms

See Also