Smart Cloud Document Clustering and plagiarism checker using TF-IDF Based on Cosine Similarity

Journal Title: GRD Journal for Engineering - Year 2017, Vol 2, Issue 5

Abstract

This research paper describes the results oriented from experimental study of conventional document clustering techniques implemented in the commercial spaces so far. Particularly, we compared main approaches related to document clustering, agglomerative hierarchical document clustering and K-means. Though this paper, we generates and implement checker’s algorithms which deals with the duplicacy of the document content with the rest of the documents in the cloud. We also generate algorithm required to deals with the classification of the cloud data. The classification in this algorithm is done on the basis of the date of data uploaded and. We will take the ratio of both vectors and generate a score which rates the document in the classification.

Authors and Affiliations

Sudhir Sahani, Rajat Goyal, Saurabh Sharma, Shaili Gupta

Keywords

Related Articles

Analysis of Life of Pressure Vessel

Pressure Vessels are storage tanks which were constructed to keep liquids, vapors, or gases at very high pressures, usually over 15 psig. Few Examples of general pressure storage tanks used in the petro refining and chem...

Optimizing Reservoir Capacity, Water Allocation and Crop Yield using Teaching Learning Based Optimization (TLBO) Technique

In the present study ‘Teaching Learning Based Optimization’ (TLBO) optimization method has been applied to the water resources engineering problem. TLBO is a population-based natural-inspired evolutionary algorithm compa...

Identification of Urban Void Spaces in an Area of Vadodara

Increasing population and urbanization demand more resources in the urban area in terms of hard and soft infrastructure both. Many problems related to spatial resources in the urban area are addressed which disputes to d...

Authorized Public Auditing of Dynamic Storage on Cloud with Efficient Verifiable Fine-Grained Updates with MHT

One of the top technology concept is Cloud Computing. Cloud storage servers plays an important role in the technology buzz – cloud computing where clients can store their data at cloud servers and can access this data fr...

Workplace Health and Safety Parameters and Their Relations: A Systematic Review

The objective of this paper to review some important case studies to figure out the relationship between the safety of an organization and the safety climate. Safety climate of an organization always demonstrate the leve...

Download PDF file
  • EP ID EP224420
  • DOI -
  • Views 107
  • Downloads 0

How To Cite

Sudhir Sahani, Rajat Goyal, Saurabh Sharma, Shaili Gupta (2017). Smart Cloud Document Clustering and plagiarism checker using TF-IDF Based on Cosine Similarity. GRD Journal for Engineering, 2(5), 331-333. https://europub.co.uk./articles/-A-224420