Evaluation Metrics

Evaluation Metrics are quantitative measures used to assess the performance of machine learning models. Choosing the right metric is essential for understanding how well a model performs, especially in classification and regression problems.

Why Are Evaluation Metrics Important?

Provide objective criteria to compare different models.
Help detect issues like overfitting or underfitting.
Guide model improvement and selection.
Reflect the business or real-world importance of model predictions.

Types of Evaluation Metrics

1. Classification Metrics

These metrics evaluate models that predict discrete categories (classes).

Accuracy: Proportion of correct predictions over total predictions.

 : $Accuracy = \frac{T P + T N}{T P + T N + F P + F N}$

Precision: Proportion of true positives among all predicted positives.

 : $Precision = \frac{T P}{T P + F P}$

Recall (Sensitivity): Proportion of true positives among all actual positives.

 : $Recall = \frac{T P}{T P + F N}$

F1 Score: Harmonic mean of precision and recall, balancing both.

 : $F 1 = 2 \times \frac{Precision \times Recall}{Precision + Recall}$

Specificity: Proportion of true negatives among all actual negatives.

 : $Specificity = \frac{T N}{T N + F P}$

ROC AUC: Area under the Receiver Operating Characteristic curve, measuring true positive rate vs false positive rate across thresholds.

Precision-Recall AUC (AUPRC): Area under the Precision-Recall curve, especially useful for imbalanced data.

2. Regression Metrics

Used for models predicting continuous values.

Mean Absolute Error (MAE): Average absolute difference between predicted and actual values.

 : $M A E = \frac{1}{n} \sum_{i = 1}^{n} | y_{i} - {\hat{y}}_{i} |$

Mean Squared Error (MSE): Average squared difference, penalizes larger errors more.

 : $M S E = \frac{1}{n} \sum_{i = 1}^{n} (y_{i} - {\hat{y}}_{i})^{2}$

Root Mean Squared Error (RMSE): Square root of MSE, in same units as output.

 : $R M S E = \sqrt{M S E}$

R-squared (R²): Proportion of variance explained by the model.

 : $R^{2} = 1 - \frac{\sum (y_{i} - {\hat{y}}_{i})^{2}}{\sum (y_{i} - \bar{y})^{2}}$

Choosing the Right Metric

Imbalanced Classification: Use Precision, Recall, F1 Score, or AUPRC instead of accuracy.
Cost-Sensitive Tasks: Consider metrics that weigh errors differently.
Regression: Use MAE or RMSE based on error tolerance.

Related Pages

SEO Keywords

evaluation metrics in machine learning, classification metrics, regression evaluation metrics, precision recall f1 score, roc auc explained, mean squared error, choosing evaluation metrics, model performance measures