A Hybrid Approach for Measuring Semantic Similarity between Documents and its Application in Mining the Knowledge Repositories

Abstract

This paper explains about similarity measure and the relationship between the knowledge repositories. This paper also describes the significance of document similarity measures, algorithms and to which type of text it can be applied Document similarity measures are of full text similarity, paragraph similarity, sentence similarity, semantic similarity, structural similarity and statistical measures. Two different frameworks had been proposed in this paper, one for measuring document to document similarity and the other model which measures similarity between documents to multiple documents. These two proposed models can use any one of the similarity measures in implementation aspect, which is been put forth for further research.

Authors and Affiliations

K. Sumathy, Chidambaram

Keywords

Related Articles

An Enhanced Concept based Approach for User Centered Health Information Retrieval to Address Presentation Issues

The diversity of health information seekers signifies the enormous variety of information needs by numerous users. The existing health information retrieval systems failed to address the information needs of both medical...

Smartphones-Based Crowdsourcing Approach for Installing Indoor Wi-Fi Access Points

This study provides a new Crowdsourcing-based approach to identify the most crowded places in an indoor environment. The Crowdsourcing Indoor Localization system (CSI) has been one of the most used techniques in location...

A Cloud-Based Platform for Democratizing and Socializing the Benchmarking Process

Performances evaluation, benchmarking and re-producibility represent significant aspects for evaluating the practical impact of scientific research outcomes in the Computer Science field. In spite of all the benefits (e....

Assessing Trends of Existing Research Contribution Towards Internet-of-Things

With the growing demands of system automation, technology integration, and non-human intervention technique, Internet-of-Things (IoT) has evolved as a boon and value-added services over pervasive computing. IoT comprises...

Classifying three Communities of Assam Based on Anthropometric Characteristics using R Programming

The study of anthropometric characteristics of different communities plays an important role in design, ergonomics and architecture. As the change of life style, nutrition and ethnic composition of different communities...

Download PDF file
  • EP ID EP128610
  • DOI 10.14569/IJACSA.2016.070831
  • Views 123
  • Downloads 0

How To Cite

K. Sumathy, Chidambaram (2016). A Hybrid Approach for Measuring Semantic Similarity between Documents and its Application in Mining the Knowledge Repositories. International Journal of Advanced Computer Science & Applications, 7(8), 231-237. https://europub.co.uk./articles/-A-128610