"GOOD" AND "BAD" k-OCCURRENCES
When labels are present, k-occurrences can be distinguished based on whether labels of neighbors match.
- : We define the number of "bad" k-occurrences of , as the number of points from data set for which is among the first k-NNs, and the labels of and the points in question do not match.
- : Conversely, , the number of "good" k-occurrences of , is the number of such points where labels do match.
- Naturally, for every .