Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Efficient K-Means Algorithm for Data Clustering Using Calinski Indexing


Affiliations
1 Dr. SNS Rajalakshmi College of Arts and Science, India
2 Dept. of Computer Technology, Dr. SNS Rajalakshmi College of Arts and Science, India
     

   Subscribe/Renew Journal


In Data mining clustering is one of the important tools. Several research areas clustering is to be used and it describe the method for grouping the data. Describes the K-Means clustering algorithm and it has used the best validity index (Calinski index) for the attribute selection, having the value of the validity index as fitness function. Calinski index is to find the best number of clusters for the whole data set. The method is to study the maximum value maxk of ik (where k is the number of clusters and ik is the Calinski index value for k clusters). Number of cluster can be calculated by using Calinski index values along with NMF. The rand index value and the accuracy for the Calinski value is obtained, which proves that rand index value and accuracy is better than the existing clusters.

Keywords

K-Means Clustering, Calinski Index Value, Rand Index Value, WBC Dataset.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 160

PDF Views: 2




  • Efficient K-Means Algorithm for Data Clustering Using Calinski Indexing

Abstract Views: 160  |  PDF Views: 2

Authors

K. Sheela
Dr. SNS Rajalakshmi College of Arts and Science, India
N. Kamalraj
Dept. of Computer Technology, Dr. SNS Rajalakshmi College of Arts and Science, India

Abstract


In Data mining clustering is one of the important tools. Several research areas clustering is to be used and it describe the method for grouping the data. Describes the K-Means clustering algorithm and it has used the best validity index (Calinski index) for the attribute selection, having the value of the validity index as fitness function. Calinski index is to find the best number of clusters for the whole data set. The method is to study the maximum value maxk of ik (where k is the number of clusters and ik is the Calinski index value for k clusters). Number of cluster can be calculated by using Calinski index values along with NMF. The rand index value and the accuracy for the Calinski value is obtained, which proves that rand index value and accuracy is better than the existing clusters.

Keywords


K-Means Clustering, Calinski Index Value, Rand Index Value, WBC Dataset.