The Automated VSMs to Categorize Arabic Text Data Sets

Journal Title: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY - Year 2014, Vol 13, Issue 1

Abstract

Text Categorization is one of the most important tasks in information retrieval and data mining. This paper aims at investigating different variations of vector space models (VSMs) using KNN algorithm. we used 242 Arabic abstract documents that were used by (Hmeidi & Kanaan, 1997). The bases of our comparison are the most popular text evaluation measures; we use Recall measure, Precision measure, and F1 measure. The Experimental results against the Saudi data sets reveal that Cosine outperformed over of the Dice and Jaccard coefficients.

Authors and Affiliations

Mamoun Suleiman Al Rababaa, Essam Said Hanandeh

Keywords

Related Articles

ON PROBABILTY DISTRIBUTION IN ASSOCIATION WITH A CERTAIN GENERALIZED HYPERGEOMETRIC FUNCTION

In the present paper, a probability function has been introduced in terms of the -function and its properties are studied. It is shown that the classical non-central distributions such as, non-central chi-square, non-cen...

RSA Algorithm achievement with Federal information processing Signature for Data protection in Cloud Computing

Cloud computing presents IT organizations with a funda­mentally different model of operation, one that takes advantage of the maturity of web applications and networks and the rising interoperability of computing system...

Security in Android

New technologies have always created new areas of concern for information security teams. Usually it provides time for the development of effective security controls. The rapid growth of the smartphone in market and the...

Comparative Analysis of Various Cloud Technologies

With the increasing prevalence and demand of large scale cloudcomputing environment, a researcher has to draw more attentiontowards the services provided by the CLOUD. As the access tothe server is increasing, centralize...

ENVIRONMENTALLY SUSTAINABLE INVENTORY MODEL UNDER PERMISSIBLE DELAY IN PAYMENTS

Within the economic order quantity (EOQ) framework, the main purpose of this paper is to investigate the supplier optimal replenishment policy of permissible delay in payments. All previously published articles dealing w...

Download PDF file
  • EP ID EP650494
  • DOI 10.24297/ijct.v13i1.2925
  • Views 88
  • Downloads 0

How To Cite

Mamoun Suleiman Al Rababaa, Essam Said Hanandeh (2014). The Automated VSMs to Categorize Arabic Text Data Sets. INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY, 13(1), 4074-4081. https://europub.co.uk./articles/-A-650494