Model Evaluation

Confusion Matrix

Actual\Predicted	$C$	$\neg C$
$C$	True Positives (TP)	False Negatives (FN)
$\neg C$	False Positives (FP)	True Negatives (TN)

F = \frac{( β ^{2} + 1 ) PR}{β ^{2} P + R}

F = \frac{2 PR}{P + R}

ROC = Receiver Operating Characteristics
- See how a classifier performs with different threshold
- Visualizes tradeoff between precision and recall
Procedure
- Rank the test tuples with likelihood to be true in decreasing order
- Horizontal axis as False Positive Rate, vertical as True
Interpretation
- The area under ROC curve measures the accuracy of the model
Similarly, we have precision-recall curve

tau = (# concordant pairs - # discordant pairs) / number of pairs
Concordant pair means a positive tuple appears before a negative one in terms of prediction score ranking
Total number of pairs is $(2 n)$