Fast Efficient Clustering Algorithm for Balanced Data

Abstract

The Cluster analysis is a major technique for statistical analysis, machine learning, pattern recognition, data mining, image analysis and bioinformatics. K-means algorithm is one of the most important clustering algorithms. However, the k-means algorithm needs a large amount of computational time for handling large data sets. In this paper, we developed more efficient clustering algorithm to overcome this deficiency named Fast Balanced k-means (FBK-means). This algorithm is not only yields the best clustering results as in the k-means algorithm but also requires less computational time. The algorithm is working well in the case of balanced data.

Authors and Affiliations

Adel Sewisy, M. Marghny, Rasha ElAziz, Ahmed Taloba

Keywords

Related Articles

Implementation of Failure Enterprise Systems in Organizational Perspective Framework

Failure percentage of Enterprise Resource Planning (ERP) implementation projects stay high, even following quite a while of endeavours to diminish them. In this paper, the author proposes the exact exploration that plans...

Human Gesture Recognition using Keyframes on Local Joint Motion Trajectories

Human Action Recognition (HAR) systems are systems that recognize and classify the actions that users perform against the sensor or camera. In most HAR systems, an input test data is compared with the reference data in t...

Web Scraper Revealing Trends of Target Products and New Insights in Online Shopping Websites

Trillions of posts from Facebook, tweets in Twitter, photos on Instagram and e-mails on exchange servers are overwhelming the Internet with big data. This necessitates the development of such tools that can detect the fr...

Electronic Human Resource Management (e-HRM) of Hotel Business in Phuket

This research aims to study the pattern of the electronic human resources management (e-HRM) of the hotel business in Phuket. The study is conducted with the implementation of field data and in-depth interview of hotels’...

Multiple-Published Tables Privacy-Preserving Data Mining: A Survey for Multiple-Published Tables Techniques

With large growth in technology, reduced cost of storage media and networking enabled the organizations to collect very large volume of information from huge sources. Different data mining techniques are applied on such...

Download PDF file
  • EP ID EP147446
  • DOI 10.14569/IJACSA.2014.050619
  • Views 87
  • Downloads 0

How To Cite

Adel Sewisy, M. Marghny, Rasha ElAziz, Ahmed Taloba (2014). Fast Efficient Clustering Algorithm for Balanced Data. International Journal of Advanced Computer Science & Applications, 5(6), 123-129. https://europub.co.uk./articles/-A-147446