Empowering Document Clustering Through Multi View-Point Based Similarity Measure

Abstract

Among data mining technique, clustering is one of the most important and traditional concept also an unsupervised learning paradigm. Similarity of a document pairs can be measured by matching of concepts. Finding or extracting the most relevant concept from the documents is a challengeable task. To address this issue, in this paper we introduce a concept of multi view point based similarity measure. Our proposed methods uses multiple point of reference between document pairs to extract more relevant match concept rather than extracting only ideas based on similarity measure. Using multiple view point, gathers more information about a particular topic from many different but relevant sources or concept. This strategy works well with smaller documents but is especially effective with longer documents. By gathering more relevant concepts from the documents with multiple points of reference, the document organization and retrieval can enhance the ability to make the most use of the documents held in storage and make retrieval of ideas as well as relevant task or concept much easier and faster. Experimental results shows that our proposed method efficiently extract more relevant concept.

Authors and Affiliations

M. John Basha and Dr. S. Srinivasan

Keywords

Related Articles

New architecture and efficient inter device communication for content transferring in human networks

We present B-SUB, an interest-driven information sharing system for HUNETs, which stands for the bloom-filter-based publish/SUBscribe. B-SUB is calculated for small to medium sized networks selfpossessed of dozens of...

Windows, Linux and Mac Operating system Booting Process: a Comparative Study

This paper presents a comparative study of Booting Paradigm of Windows, Linux and Mac, the three popular operating systems. Booting process is the essential and first step perform by the OS after this process executio...

Construction of Covert Channel Using Data Hiding Mechanism In Ipv4

Covert channel is a way of abstracting the information and hiding the information, in such a way that the reader should not analyse the hidden information and transmit the data to the destination. The covert techniqu...

Image Depth Approximation using Bezier-Bernstein polynomial for 3D cameras

passive depth estimation techniques with improved precision can replace active methods in 3D cameras. In this paper, we introduce accurate depth estimation from image focus using cubic degree Bezier-Bernstein polynomi...

Mobile Banking Services on Data Protection Analysis In Networking

Mobile banking operations is one of the popular business application areas. Several applications are developed to support bank focused application such as internet banking and mobile banking applications. In this proj...

Download PDF file
  • EP ID EP27622
  • DOI -
  • Views 337
  • Downloads 4

How To Cite

M. John Basha and Dr. S. Srinivasan (2013). Empowering Document Clustering Through Multi View-Point Based Similarity Measure. International Journal of Research in Computer and Communication Technology, 2(8), -. https://europub.co.uk./articles/-A-27622