Open Access Open Access  Restricted Access Subscription Access

An Arbitrary Gini Index for the Redundant Feature Datasets Analysis


Affiliations
1 Department of CSE, P V P Siddhartha Institute of Technology, Vijayawada − 520007, Andhra Pradesh, India
2 Department of CSE, ANU College, Guntur − 522510, Andhra Pradesh, India
3 Department of CSE, JKC College, Guntur − 522006, Andhra Pradesh, India
4 Department of CSE, SRK Institute of Technology, Vijayawada − 521108, Andhra Pradesh, India
 

Objectives: Knowledge Discovery methods get more accurate results when the dimensionality of the data is subsided; dimensionality is an important aspect of any data. Several algorithms have been proposed to increase the accuracy, but most of them generate complex models as the size of the data is extremely large. Objective of this paper is to build a simple model to get high accuracy. Method: In order to increase the accuracy of the Knowledge Discovery methods by substituting the dimensionality, we introduce a novel heuristic functionality, Arbitrary Gini Index (ArGI). Findings: We evaluated the performance of ArGI on the real world datasets. The experiment on the ten real world data sets analysis shows 60% data sets are more accurate for ArGI and 40% for Gini Index. Applications: It is expecting that the applications of ArGI will show a better approach in the real world learning tasks.

Keywords

Arbitrary Gini Index, CART, Classification, Datasets, Decision Tree, Filtering, Random Sampling.
User

Abstract Views: 230

PDF Views: 0




  • An Arbitrary Gini Index for the Redundant Feature Datasets Analysis

Abstract Views: 230  |  PDF Views: 0

Authors

Rajesh Vemulakonda
Department of CSE, P V P Siddhartha Institute of Technology, Vijayawada − 520007, Andhra Pradesh, India
Abdul Ahad
Department of CSE, ANU College, Guntur − 522510, Andhra Pradesh, India
Suresh Babu Yalavarthi
Department of CSE, JKC College, Guntur − 522006, Andhra Pradesh, India
Praneeth Cheraku
Department of CSE, SRK Institute of Technology, Vijayawada − 521108, Andhra Pradesh, India
Nageswara Rao Puli
Department of CSE, SRK Institute of Technology, Vijayawada − 521108, Andhra Pradesh, India

Abstract


Objectives: Knowledge Discovery methods get more accurate results when the dimensionality of the data is subsided; dimensionality is an important aspect of any data. Several algorithms have been proposed to increase the accuracy, but most of them generate complex models as the size of the data is extremely large. Objective of this paper is to build a simple model to get high accuracy. Method: In order to increase the accuracy of the Knowledge Discovery methods by substituting the dimensionality, we introduce a novel heuristic functionality, Arbitrary Gini Index (ArGI). Findings: We evaluated the performance of ArGI on the real world datasets. The experiment on the ten real world data sets analysis shows 60% data sets are more accurate for ArGI and 40% for Gini Index. Applications: It is expecting that the applications of ArGI will show a better approach in the real world learning tasks.

Keywords


Arbitrary Gini Index, CART, Classification, Datasets, Decision Tree, Filtering, Random Sampling.



DOI: https://doi.org/10.17485/ijst%2F2017%2Fv10i4%2F139347