A Zone Classification Approach for Arabic Documents using Hybrid Features
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2016, Vol 7, Issue 7
Abstract
Zone segmentation and classification is an important step in document layout analysis. It decomposes a given scanned document into zones. Zones need to be classified into text and non-text, so that only text zones are provided to a recognition engine. This eliminates garbage output resulting from sending non-text zones to the engine. This paper proposes a framework for zone segmentation and classification. Zones are segmented using morphological operation and connected component analysis. Features are then extracted from each zone for the purpose of classification into text and non-text. Features are hybrid between texture-based and connected component based features. Effective features are selected using genetic algorithm. Selected features are fed into a linear SVM classifier for zone classification. System evaluation shows that the proposed zone classification works well on multi-font and multi-size documents with a variety of layouts even on historical documents.
Authors and Affiliations
Amany M. Hesham, Sherif Abdou, Amr Badr, Mohsen Rashwan, Hassanin M. Al-Barhamtoshy
Enhanced and Improved Hybrid Model to Prediction of User Awareness in Agriculture Sector
Agriculture is the backbone of Indian economy and is the main income source for most of the population in India. So farmers are always curious about yield prediction. Crop yield depends on various factors like soil, weat...
A Study of Feature Selection Algorithms for Predicting Students Academic Performance
The main aim of all the educational organizations is to improve the quality of education and elevate the academic performance of students. Educational Data Mining (EDM) is a growing research field which helps academic in...
New Approach of Automatic Modulation Classification based on in Phase-Quadrature Diagram Combined with Artificial Neural Network
Automatic Modulation Classification (AMC) with intelligent system is an attracting area of research due to the development of SDR (Software Defined Radio). This paper proposes a new algorithm based on a combination of k-...
Power Management of a Stand-Alone Hybrid (Wind/Solar/Battery) Energy System: An Experimental Investigation
In this manuscript, a hybrid wind/solar/battery energy system is proposed for a stand-alone applications. Wind-solar energy sources are used as power generation source in the proposed hybrid energy system (HES), whereas...
New mechanism for Cloud Computing Storage Security
Cloud computing, often referred to as simply the cloud, appears as an emerging computing paradigm which promises to radically change the way computer applications and services are constructed, delivered, managed and fina...