Overfitting

Definition

When a machine learning model learns the training data too well, including its noise and quirks, making it perform poorly on new data.

Use Cases

Frequently Asked Questions

What's the difference between overfitting and underfitting?
Overfitting happens when a model learns the training data too specifically (including noise), so it performs well on training data but poorly on new data. Underfitting is the opposite: the model is too simple or not trained enough, so it performs poorly on both training and new data.
When should I worry about overfitting?
Worry about overfitting when your training metrics are much better than your validation/test metrics, when you have a small dataset, when your model is very complex (e.g., deep networks, high-degree polynomials), or when you have many features compared to the number of examples. In production, a sudden drop in real-world accuracy compared to offline results can also indicate overfitting or data drift.
How much does overfitting cost?
Overfitting doesn’t have a direct price tag, but it increases costs through wasted training time (more experiments and longer training), higher cloud compute spend (GPUs/TPUs/CPUs for repeated runs), extra data labeling/collection, and business losses from wrong predictions (e.g., false fraud alerts, missed fraud, poor recommendations). The biggest cost drivers are model complexity, number of training runs, dataset size, and the need for ongoing monitoring and retraining.

Category: ai-ml

Difficulty: intermediate

Related Terms

See Also