Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

A Novel Algorithm for Association Rule Mining from Data with Incomplete and Missing Values


Affiliations
1 Department of Information Technology, Hindustan University, Tamil Nadu, India
     

   Subscribe/Renew Journal


Missing values and incomplete data are a natural phenomenon in real datasets. If the association rules mine incomplete disregard of missing values, mistaken rules are derived. In association rule mining, treatments of missing values and incomplete data are important. This paper proposes novel technique to mine association rule from data with missing values from large voluminous databases. The proposed technique is decomposed into two sub problems: database scrutinizes and rules mining phases. The first phase is used to reexamine transactions which are useful to mine frequent itemset. The second phase is to mine frequent itemset and construct association rules from valid database. This paper uses Apriori based algorithm in which proposed technique. The proposed technique is tested with synthetic and real T40I10D100K, Mushroom, Chess and Heart disease prediction datasets. Experimental results are shown that the proposed technique outperforms than robust association rule mining (RAR) and Association rules from data with Missing values by Database Partitioning and Merging (AMDPM) algorithm.

Keywords

Data Mining, Association Rule, Frequent Itemset, Missing Values, Incomplete Dataset.
Subscription Login to verify subscription
User
Notifications
Font Size

Abstract Views: 295

PDF Views: 0




  • A Novel Algorithm for Association Rule Mining from Data with Incomplete and Missing Values

Abstract Views: 295  |  PDF Views: 0

Authors

K. Rameshkumar
Department of Information Technology, Hindustan University, Tamil Nadu, India

Abstract


Missing values and incomplete data are a natural phenomenon in real datasets. If the association rules mine incomplete disregard of missing values, mistaken rules are derived. In association rule mining, treatments of missing values and incomplete data are important. This paper proposes novel technique to mine association rule from data with missing values from large voluminous databases. The proposed technique is decomposed into two sub problems: database scrutinizes and rules mining phases. The first phase is used to reexamine transactions which are useful to mine frequent itemset. The second phase is to mine frequent itemset and construct association rules from valid database. This paper uses Apriori based algorithm in which proposed technique. The proposed technique is tested with synthetic and real T40I10D100K, Mushroom, Chess and Heart disease prediction datasets. Experimental results are shown that the proposed technique outperforms than robust association rule mining (RAR) and Association rules from data with Missing values by Database Partitioning and Merging (AMDPM) algorithm.

Keywords


Data Mining, Association Rule, Frequent Itemset, Missing Values, Incomplete Dataset.