"GOOD" AND "BAD" k-OCCURRENCES

When labels are present, k-occurrences can be distinguished based on whether labels of neighbors match.

  • : We define the number of "bad" k-occurrences of , as the number of points from data set for which is among the first k-NNs, and the labels of and the points in question do not match.
  • : Conversely, , the number of "good" k-occurrences of , is the number of such points where labels do match.
  • Naturally, for every .