Experiment Tracking

Definition

Recording and comparing different machine learning experiments, including their configurations, metrics, and results for informed decision-making.

Use Cases

Provider Equivalents

Frequently Asked Questions

What's the difference between experiment tracking and model registry?
Experiment tracking records what happened during training (parameters, code version, metrics, artifacts) so you can compare runs and reproduce results. A model registry is for managing the lifecycle of selected models after training—versioning them, adding approvals, and promoting them to staging or production.
When should I use experiment tracking?
Use it as soon as you run more than a few training attempts or need reproducibility. It’s especially useful when you tune hyperparameters, compare different feature sets, run A/B model candidates, collaborate across a team, or must audit how a model was produced for compliance.
How much does experiment tracking cost?
Costs depend on (1) the tracking service pricing model and (2) the storage/compute you use. Many platforms include basic experiment tracking features as part of their managed ML service, but you still pay for training compute, artifact storage (e.g., object storage for models), and sometimes metadata storage or managed workspace fees. Self-hosting MLflow can reduce direct service fees but shifts cost to your infrastructure and operations.

Category: ai-ml

Difficulty: intermediate

Related Terms

See Also