Clustering

Machine Learning

Unsupervised Learning

January 16, 2021

Discuss the differences between K-Means and Hierarchical clustering.

Following are the difference between K-Means and Hierarchical Clustering Algorithm (HCA)

  1. K-Means is that it needs us to pre-enter the number of clusters (K) but Hierarchical clustering has no such requirements. The algorithm on itself deduces the optimum number of cluster and displays it form of dendrogram.
  2. Performance of K-Means on spherical data is better than that of HCA
  3. Hierarchical clustering is a purely agglomerative approach and goes on to build one giant cluster. K-Means algorithm in all its iterations has same number of clusters.
  4. K-Means need circular data, while Hierarchical clustering has no such requirement.
  5. K-Means uses median or mean to compute centroid for representing cluster while HCA has various linkage method that may or may not employ the centroid.
  6. With introduction of mini batches K-Means can work with very large datasets but HCA lacks in this regard.
  7. Hierarchical methods are  suited for cases which require arrangement of the clusters  into a natural hierarchy. In K-means all clusters are on same level i.e. similar WCSS or cohesiveness.
  8. HCA can produce reproducible results while older versions of K-Means can’t
  9. K-Means simply divides data into mutually exclusive subsets while HCA arranges it into a tree format.

by : Monis Khan

Quick Summary:

Following are the difference between K-Means and Hierarchical Clustering Algorithm (HCA) K-Means is that it needs us to pre-enter the number of clusters (K) but Hierarchical clustering has no such requirements. The algorithm on itself deduces the optimum number of cluster and displays it form of dendrogram. Performance of K-Means on spherical data is better […]