(Master of Information Science)

(Master of Information Science) Machine Learning

  • Anomaly Detection
    • k th-NN
      • First, measure the distance between your point and all other points.
      • Sort them and use the distance to the k-th nearest point as the score.
      • Why k-th?
        • We want to find outliers.
        • If k=1, we may not be able to find outliers that are close to each other.
        • So we use k=2, 3, or higher.