Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Clustering Algorithms Using Different Distance Measures-A Comparison


Affiliations
1 Department of Statistics, Manonmaniam Sundaranar University, Tirunelveli-627012, India
2 Department of Statistics, Manonmaniam Sundaranar University, Tirunelveli, India
     

   Subscribe/Renew Journal


Data mining is the process of discovering meaningful correlations, trend and interesting patterns from a large volume of data. Clustering is the process of grouping similar data elements together. In this paper k-means algorithm and k-medoid algorithm is used along with distance measures like Euclidean, Manhattan and Squared on areal time medical data set a to group of similar patients based on their vision ailments. The Results are compared numerically and graphically to find the best distance measure. Experimental results shows that k-medoids clustering algorithm outperforms k-means clustering. The experiment was repeated using different distance measures like Euclidean, Manhatten and Squared. The results shows that k-medoid with Euclidean distance measure forms the most densed cluster and thus it is very effective than other distance measures.

Keywords

Data Mining, Clustering, K-Means K-Medoids and Distance Measure.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 180

PDF Views: 2




  • Clustering Algorithms Using Different Distance Measures-A Comparison

Abstract Views: 180  |  PDF Views: 2

Authors

Elaiyaperumal Sakthivel
Department of Statistics, Manonmaniam Sundaranar University, Tirunelveli-627012, India
Kaliaperumal Senthamarai Kannan
Department of Statistics, Manonmaniam Sundaranar University, Tirunelveli, India

Abstract


Data mining is the process of discovering meaningful correlations, trend and interesting patterns from a large volume of data. Clustering is the process of grouping similar data elements together. In this paper k-means algorithm and k-medoid algorithm is used along with distance measures like Euclidean, Manhattan and Squared on areal time medical data set a to group of similar patients based on their vision ailments. The Results are compared numerically and graphically to find the best distance measure. Experimental results shows that k-medoids clustering algorithm outperforms k-means clustering. The experiment was repeated using different distance measures like Euclidean, Manhatten and Squared. The results shows that k-medoid with Euclidean distance measure forms the most densed cluster and thus it is very effective than other distance measures.

Keywords


Data Mining, Clustering, K-Means K-Medoids and Distance Measure.