Identifying Outliers in Datasets Using Outlier Removal Clustering (ORC) Algorithm

N. Nirmaladevi; R. Suresh Kumar

Identifying Outliers in Datasets Using Outlier Removal Clustering (ORC) Algorithm

N. Nirmaladevi , R. Suresh Kumar

Affiliations
1 Department of Computer Science, Sree Saraswathi Thyagaraja College, Thippampatti, Pollachi, India

The objective function of general K-Mean, this work associates a weight vector with each cluster to indicate which dimensions are relevant to the clusters. To prevent the value of the objective function from decreasing because of the elimination of dimensions, virtual dimensions are added to the objective function. The values of data points on virtual dimensions are set artificially to ensure that the objective function is minimized when the real subspace clusters or the clusters in original space are found. The outlier detection problem in some cases is similar to the classification problem. For example, the main concern of clustering-based outlier detection algorithms is to find clusters and outliers, which are often regarded as noise that should be removed in order to make more reliable clustering. This research work presents an algorithm that provides outlier detection and data clustering simultaneously. The algorithm improves the estimation of centroids of the generative distribution during the process of clustering and outlier discovery.

Keywords

Data Mining, Clustering, K-Means, High Dimensions, Outlier Removal Clustering (ORC) Algorithm.

I-Scholar

Journal Help

User

Subscription Login to verify subscription

Notifications

Journal Content
Browse

Font Size

Information

Abstract Views: 170

PDF Views: 3

Biometrics and Bioinformatics

Identifying Outliers in Datasets Using Outlier Removal Clustering (ORC) Algorithm

Subscribe/Renew Journal

Keywords

Identifying Outliers in Datasets Using Outlier Removal Clustering (ORC) Algorithm

Authors

Abstract

Keywords

Username
Password
Remember me

Username
Password
Remember me