Efficiency of K-Means Clustering Algorithm in Mining Outliers from Large Data Sets

Apply

Efficiency of K-Means Clustering Algorithm in Mining Outliers from Large Data Sets

Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 9

Abstract

This paper presents the performance of k-means clustering algorithm, depending upon various mean values input methods. Clustering plays a vital role in data mining. Its main job is to group the similar data together based on the haracteristic they possess. The mean values are the centroids of the specified number of cluster groups. The centroids, though gets changed during the process of clustering, are alculated using several methods. Clustering algorithms can be applied for image analysis, pattern recognition, bio-informatics and in several other fields. The clustering algorithm consists to two stages with first stage forming the clusters-calculating centroid and the second stage determining the outliers. There are three methods for assigning the mean values in k-means clustering algorithm. The three mean value assignment methods are implemented, performance is analysed and comparison of every method is done. Outliers, the is advantage of the process are used in the analyzation to determine the performance with various mean inputs and methods.

Authors and Affiliations

Sridhar. A , Sowndarya. S

Keywords

EP ID EP97295
DOI -
Views 133
Downloads 0