Open Access Open Access  Restricted Access Subscription Access

Privacy Preserving Data Mining for Ordinal Data using Correlation Based Transformation Strategy (CBTS)


Affiliations
1 Visvesvaraya Technological University, Belagavi - 590 018, Karnataka, India
2 University Visvesvaraya College of Engineering, Bangalore University, Bangalore - 560001, Karnataka, India
3 BMS College of Engineering, Bangalore - 560019, Karnataka, India
 

Objectives: Preservation of privacy is a significant aspect of data mining. The main objective of PPDM is to hide or provide privacy to certain sensitive information so that they can be protected from unauthorized parties or intruders. Methods/ Statistical Analysis: Though privacy is achieved by hiding the sensitive or private data, it will affect the data mining algorithms in knowledge extraction, so an effective method or strategy is required to provide privacy to the data and simultaneously protecting the quality of data mining algorithms. Instead of removing or encrypting sensitive or private data, we make use of data transformation strategies that keep the statistical, semantic and heuristic nature of data while protecting the sensitive or private data. Findings: In this paper we studied the technical feasibility of realizing Privacy Preserving Data Mining. In the proposed work, Correlation Based Transformation Strategy for Privacy Preserving Data Mining is used for ordinal data. We apply the method on few datasets namely soybean, Breast Cancer, Nursery dataset and Car dataset. We tabulate the end results applying the proposed strategy on both the original and the transformed dataset and observe correlation difference, Information Entropy and Classification Accuracy with different machine learning algorithms and Clustering Quality. Application/Improvements: As an improvement, the proposed work can be extended by use of vector marking techniques where these techniques help in increasing the efficiency by avoiding unauthorised access to the information.

Keywords

Correlation Analysis, Nominal Data, Ordinal Data, Privacy Preserving Data Mining, Transformation Strategy.
User

Abstract Views: 170

PDF Views: 0




  • Privacy Preserving Data Mining for Ordinal Data using Correlation Based Transformation Strategy (CBTS)

Abstract Views: 170  |  PDF Views: 0

Authors

N. P. Nethravathi
Visvesvaraya Technological University, Belagavi - 590 018, Karnataka, India
Prasanth G. Rao
Visvesvaraya Technological University, Belagavi - 590 018, Karnataka, India
Chaitra C. Vaidya
University Visvesvaraya College of Engineering, Bangalore University, Bangalore - 560001, Karnataka, India
S. Geethanjali
University Visvesvaraya College of Engineering, Bangalore University, Bangalore - 560001, Karnataka, India
P. Madhura
University Visvesvaraya College of Engineering, Bangalore University, Bangalore - 560001, Karnataka, India
K. Neha Nandan
University Visvesvaraya College of Engineering, Bangalore University, Bangalore - 560001, Karnataka, India
P. Deepa Shenoy
University Visvesvaraya College of Engineering, Bangalore University, Bangalore - 560001, Karnataka, India
M. Indiramma
BMS College of Engineering, Bangalore - 560019, Karnataka, India
K. R. Venugopal
University Visvesvaraya College of Engineering, Bangalore University, Bangalore - 560001, Karnataka, India

Abstract


Objectives: Preservation of privacy is a significant aspect of data mining. The main objective of PPDM is to hide or provide privacy to certain sensitive information so that they can be protected from unauthorized parties or intruders. Methods/ Statistical Analysis: Though privacy is achieved by hiding the sensitive or private data, it will affect the data mining algorithms in knowledge extraction, so an effective method or strategy is required to provide privacy to the data and simultaneously protecting the quality of data mining algorithms. Instead of removing or encrypting sensitive or private data, we make use of data transformation strategies that keep the statistical, semantic and heuristic nature of data while protecting the sensitive or private data. Findings: In this paper we studied the technical feasibility of realizing Privacy Preserving Data Mining. In the proposed work, Correlation Based Transformation Strategy for Privacy Preserving Data Mining is used for ordinal data. We apply the method on few datasets namely soybean, Breast Cancer, Nursery dataset and Car dataset. We tabulate the end results applying the proposed strategy on both the original and the transformed dataset and observe correlation difference, Information Entropy and Classification Accuracy with different machine learning algorithms and Clustering Quality. Application/Improvements: As an improvement, the proposed work can be extended by use of vector marking techniques where these techniques help in increasing the efficiency by avoiding unauthorised access to the information.

Keywords


Correlation Analysis, Nominal Data, Ordinal Data, Privacy Preserving Data Mining, Transformation Strategy.



DOI: https://doi.org/10.17485/ijst%2F2016%2Fv9i47%2F136040