REVIEW OF CLUSTERING UNCERTAIN DATA

Abstract

 Clustering on uncertain data, one of the essential tasks in mining uncertain data, posts significant challenges on both modeling similarity between uncertain objects and developing efficient computational methods. The previous methods extend traditional partitioning clustering methods like k-means and density-based clustering methods like DBSCAN to uncertain data, thus rely on geometric distances between objects. Such methods cannot handle uncertain objects that are geometrically indistinguishable, such as products with the same mean but very different variances in customer ratings. Surprisingly, probability distributions, which are essential characteristics of uncertain objects, have not been considered in measuring similarity between uncertain objects. In this project, we systematically model uncertain objects in both continuous and discrete domains, where an uncertain object is modeled as a continuous and discrete random variable, respectively. We use the well-known Kullback-Leibler divergence to measure similarity between uncertain objects in both the continuous and discrete cases, and integrate it into partitioning and density-based clustering methods to cluster uncertain objects.

Authors and Affiliations

Ms. Nikhatparvin Ahamad*

Keywords

Related Articles

 JIT Manufacturing Systems in Indian Industries: A Review

 Most of people understand JIT as a system of reducing inventory and quality control but they do not realize that it is a system of highlighting the problems, and forcing the organization to find quick solutions. I...

 CHARACTERIZATION STUDY & TREATMENT OF MSW OF CROWDED RESIDENTIAL AREA OF AMRAVATI CITY, MS, INDIA

 The increasing population is worldwide problem which can be seen in Amravati city also. Due to rapid growth of population in Amravati municipal corporation area & changing life styles has resulted in increased...

 Quality of Groundwaters of the Rural District EL Ganzra (Province of Khemisset, Morocco)

 The study of groundwater quality in the rural commune of EL GANZRA by agricultural excellence is of importance for the use of groundwater for various activities (drinking, irrigation, patenting ...) to do a follow...

 Leaf Recognition Algorithm Using MLP Neural Network Based Image Processing

 In this paper, we employ Multilayer Perceptron with image and data processing techniques and neural network to implement a general purpose automated leaf recognition. Sampling leaves and photoing them are low cos...

GENDER AND LEVELS OF ATTAINMENT OF SCIENTIFIC LITERACY AMONG SCIENCE STUDENTS UNDER CONSTRUCTIVIST INSTRUCTIONAL MODEL

The study investigated levels of attainment of scientific literacy by junior secondary (8th grade) male and female students. Quasi-experiment of non- equivalent control group design was used. A total of 162 students were...

Download PDF file
  • EP ID EP90900
  • DOI 10.5281/zenodo.61474
  • Views 91
  • Downloads 0

How To Cite

Ms. Nikhatparvin Ahamad* (30).  REVIEW OF CLUSTERING UNCERTAIN DATA. International Journal of Engineering Sciences & Research Technology, 5(9), 119-121. https://europub.co.uk./articles/-A-90900