Open Access Open Access  Restricted Access Subscription Access

A Survey on K-Means Clustering in Various Domains


Affiliations
1 Department of Computer Science and Engineering, Mody Univesity of Science and Technology, Lakshmangarh, Sikar, Rajasthan, India
 

   Subscribe/Renew Journal


Data mining is used to extract the hidden patterns from large datasets and extracted patterns are helpful to identify knowledge about data to users. As various approaches are there for data mining named Clustering, Classification, Association rule mining, etc. Amongst all we consider clustering, which is an unsupervised learning and grouping. This paper demonstrates clustering technique named k-means clustering and its various improvements in different domains exterminate the limitations of traditional k-means clustering. K-means clustering is the simple partitioning clustering algorithm and exhibit many limitations, so it is very important to understand various enhancements for constructing hybrid algorithms to improve accuracy of algorithms. Various areas are defined where k-means clustering is widely used nowadays such as in healthcare, improving academic performance and optimization of search engine and much more.

Keywords

Academics, Clustering, Data Mining, Healthcare, K-Means, Search Engine.
User
Subscription Login to verify subscription
Notifications
Font Size

  • J. Han, M. Kamber, and J. Pei, Data Mining: Concepts and Techniques, 2012.
  • C. C. Aggarwal, Data Mining: The Text Book, 1st ed., New York, Springer, 2015.
  • M. Zaki, and W. Meira, Data Mining and Analysis, New York, Cambridge University Press, 2014.
  • S. Shukla, and S. Naganna, “A review on k-means data clustering approach,” International Journal of Information & Computer Technology, vol. 4, no. 17, pp. 1847-1860, 2014.
  • J. Qiao, and Y. Zhang, “Study on k-means method based on data-mining,” in 2015 Chinese Automation Congress (CAC), pp. 51-54, 2015.
  • A. Yadav, and S. Dhingra, “A review on k-means clustering technique,” International Journal of Latest Research in Science and Technology, vol. 5, no. 4, pp. 13-16, 2016.
  • K. O. Khorsheed, M. M. Madbouly, and S. K. Guirguis, “Search engine optimization using data mining approach,” International Journal of Computer Engineering and Applications, vol. 9, no. 6, part 1, pp. 184-200, 2015.
  • M. E. Ahmed, and P. Bansal, “Clustering technique on search engine dataset using data mining tool,” 2013 Third International Conference on Advanced Computing and Communication Technologies (ACCT), pp. 86-89, 2013.
  • H. I. Arumawadu, R. M. K. T. Rathnayaka, and S. K. Illangarathne, “K-means clustering for segment web search results,” International Journal of Engineering Works, vol. 2, no. 8, pp. 79-83, 2015.
  • O. J. Oyelade, O. O. Oladipupo, and I. C. Obagbuwa, “Application of k means clustering algorithm for prediction of students academic performance,” International Journal of Computer Science and Information Security, vol. 7, no. 1, pp. 292-295, 2010.
  • A. M. de Morais, J. M. F. R. Araujo, and E. B. Costa, “Monitoring student performance using data clustering and predictive modelling,” 2014 IEEE Frontiers in Education Conference (FIE) Proceedings, pp. 1-8, 2014.
  • I. Singh, A. S. Sabitha, and A. Bansal, “Student performance analysis using clustering algorithm,” 2016 6th International Conference - Cloud System and Big Data Engineering (Confluence), pp. 294-299, 2016.
  • Md. H. I. Shovon, and M. Haque, “Prediction of student academic performance by an application of data mining techniques,” International Journal of Advanced Research in Computer Scienece and Software Engineering, vol. 2, no. 7, pp. 353-355, July 2012.
  • R. A. Haraty, M. Dimishkieh, and M. Masud, “An enhanced k-means clustering algorithm for pattern discovery in healthcare data,” International Journal of Distributed Sensor Networks, vol. 5, 2015.
  • V. Rogeith, and S. Magesh, “A survey on health care data using data mining techniques,” International Journal of Pure and Applied Mathematics, vol. 117, no. 16, pp. 665-672, 2017.
  • T. Meyyappan, and S. Ganga, “Performance of students evaluation in education sector using clustering k-means algorithms,” International Journal of Computer Science and Mobile Computing, vol. 3, no. 7, pp. 579-584, 2014.
  • A. Altayeva, S. Zharas, and Y. I. Cho, “Medical decision making diagnosis system integrating k-means and Naïve Bayes algorithms,” 2016 16th International Conference on Control, Automation and Systems (ICCAS), pp. 1087-1092, 2016.
  • A. Alsayat, and H. El-Sayed, “Efficient genetic k-means clustering for health care knowledge discovery,” 2016 IEEE 14th International Conference on Software Engineering Research, Management and Applications (SERA), IEEE, 2016.
  • A. Malarvizhi, and S. Ravichandran, “Data mining’s role in mining medical datasets for disease assessments - A case study,” International Journal of Pure and Applied Mathematics, vol. 119, no. 12, pp. 16255-16260, 2018.

Abstract Views: 242

PDF Views: 94




  • A Survey on K-Means Clustering in Various Domains

Abstract Views: 242  |  PDF Views: 94

Authors

Shivangi Singla
Department of Computer Science and Engineering, Mody Univesity of Science and Technology, Lakshmangarh, Sikar, Rajasthan, India
Pinaki Ghosh
Department of Computer Science and Engineering, Mody Univesity of Science and Technology, Lakshmangarh, Sikar, Rajasthan, India

Abstract


Data mining is used to extract the hidden patterns from large datasets and extracted patterns are helpful to identify knowledge about data to users. As various approaches are there for data mining named Clustering, Classification, Association rule mining, etc. Amongst all we consider clustering, which is an unsupervised learning and grouping. This paper demonstrates clustering technique named k-means clustering and its various improvements in different domains exterminate the limitations of traditional k-means clustering. K-means clustering is the simple partitioning clustering algorithm and exhibit many limitations, so it is very important to understand various enhancements for constructing hybrid algorithms to improve accuracy of algorithms. Various areas are defined where k-means clustering is widely used nowadays such as in healthcare, improving academic performance and optimization of search engine and much more.

Keywords


Academics, Clustering, Data Mining, Healthcare, K-Means, Search Engine.

References