Arabic Lexicon Learning to Analyze Sentiment in Microblogs

Abstract

The study and classifying of opinions distilled from social media is called sentiment analysis. The goal of this study is to build an adaptive sentiment lexicon for Arabic language. Based on those lexicons the sentiments polarity classification can be improved. The classification problem will be stated as a mathematical programming problem. In this problem, we search a lexicon that optimizes the classification accuracy. A genetic algorithm is presented to solve the optimization problem. A meta-level feature is generated based on the adaptive lexicons provided by the genetic algorithm. The algorithm performance is supported by using it alongside n-gram features and Bing liu’s lexicon. In this work, lexicon-based and corpora-based approaches are integrated, and the lexicons are produced from the corpus. Five data sets are tested through experiments. The sentiments in all data sets are classified based on five polarity levels. A better understanding of words sentiment orientation, social media users’ culture and Arabic language can be achieved based on the lexicons generated by the proposed algorithm. Since stop words can contribute and add to the sentiment polarity, stop words will be considered and will not deleted. The results show that the F-measure is greater than 80 % in three data sets and the accuracy is greater than 80 % for all data sets. The proposed method out-performs the current methods in the literature in two of the datasets. Finally, in terms of F-measure, the proposed methods achieved better results for three datasets.

Authors and Affiliations

Mahmoud B. Rokaya, Ahmed S. Ghiduk

Keywords

Related Articles

Convolutional Neural Networks in Predicting Missing Text in Arabic

Missing text prediction is one of the major concerns of Natural Language Processing deep learning community’s at-tention. However, the majority of text prediction related research is performed in other languages but not...

A Compendious Study of Online Payment Systems: Past Developments, Present Impact, and Future Considerations

The advent of e-commerce together with the growth of the Internet promoted the digitisation of the payment process with the provision of various online payment methods like electronic cash, debit cards, credit cards, con...

Word-Based Grammars for PPM

The Prediction by Partial Matching (PPM) compression algorithm is considered one of the most efficient methods for compressing natural language text. Despite the advances of the PPM method for the English language to pre...

LPA Beamformer for Tracking Nonstationary Accelerated Near-Field Sources

In this paper, a computationally very efficient algorithm for direction of arrival (DOA) as well as range parameter estimation is proposed for near-field narrowband nonstationary accelerated moving sources. The proposed...

Spatial Comprehension Exercise System with 3D CG of Toy Model for Disabled Children

Spatial comprehension exercise system with Three-Dimensional Computer Graphics: 3D CG of toy model for disabled children is proposed. In order to improve spatial comprehension in an attractive manner, a toy model is crea...

Download PDF file
  • EP ID EP626857
  • DOI 10.14569/IJACSA.2019.0100878
  • Views 89
  • Downloads 0

How To Cite

Mahmoud B. Rokaya, Ahmed S. Ghiduk (2019). Arabic Lexicon Learning to Analyze Sentiment in Microblogs. International Journal of Advanced Computer Science & Applications, 10(8), 592-599. https://europub.co.uk./articles/-A-626857