A PROPOSED MODEL FOR EXTRACTING INFORMATION FROM ARABIC-BASED CONTROLLED TEXT DOMAINS, DISCUSSING THE INITIAL MODEL STEPS

Apply

A PROPOSED MODEL FOR EXTRACTING INFORMATION FROM ARABIC-BASED CONTROLLED TEXT DOMAINS, DISCUSSING THE INITIAL MODEL STEPS

Journal Title: International Journal of Applied and Natural Sciences - Year 2018, Vol 7, Issue 2

Abstract

Information extraction from Arabic as well as other languages text is commonly implemented over restricted text domains. Approaching open text domains is challenging, because of the syntactic, semantic and pragmatics ambiguities and variations in text. For the purpose of approaching more relaxed versions of Arabic text domains, Fasha et al. (Fasha et al. 2017) presented a high-level description fora proposed work methodology that can establish a model for extracting information from controlled text domains. In that work, controlled text domains were defined as the text domains that are not restricted in their linguistic features or their knowledge types yet they are not very unanticipated in these respects. In this paper, we discuss that work methodology and its implementation in more detail. Our discussion includes the initial phases of the methodology which covers the corpus preparation processes including its selection, analysis and annotation using a custom morpho-syntactic Part-of-Speech tagging scheme, we also discuss the designing of the supporting knowledge-base model which will be used to represent a

Authors and Affiliations

Mohammad Fasha, Nadim Obeid, Bassam Hammo

Keywords

Narabic Natural Language Processing POS Tagging Ontology Based Information Extraction Description

SURVEY OF MAJOR FOLIAR FUNGAL DISEASES OF QUERCUS SERRATA FROM VARIOUS PARTS OF MANIPUR

India is the second largest producer of silk next to China. All the 5 varieties of silk viz., Mulberry, Tropical Tasar, Muga, Eri and Oak tasar are produced in India. Oak tasar silk is produced in North Eastern and North...

Biodiversity of Macrofungi and Slime Molds from CHM Campus

Biodiversity is the degree of variation of life forms within a given ecosystem or biome. Biodiversity is not constant across the earth. The Western Ghats in India are rich reserves of biodiversity. The present paper desc...

Integrated Management of Root Rot of Soybean

Effectiveness of organic ammendments, beneficial microbes and chemical fungicides tested under natural filed conditions against root rot of soybean incited by Rhizoctonia bataticola. Results revealed that seed treatment...

Composition Effects of Al2o3 on Ftir And DTA in Lithium Borate Glasses

Lithium aluminum borate glasses of composition 35Li2O: (65-x) B2O3: xAl2O3 (where x = 0,5,10,15,20) were prepared by melting quenching technique and investigated by XRD, SEM, DTA and FTIR measurement. Differential Therma...

Remediation of Tomato (Lycopersicum esculentum) Fruit Rot Caused by Fusarium oxysporum f. sp. lycopersici Using Various Plant Extracts

The antifungal activity of five plant extracts viz., Allium sativum, Zingiber officinale, Allium cepa, Mentha spicata and Curcuma longa were evaluated against the tomato phytopathogenic fungi, Fusarium oxysporum f. sp. L...

EP ID EP275625
DOI -
Views 107
Downloads 0

How To Cite

Mohammad Fasha, Nadim Obeid, Bassam Hammo (2018). A PROPOSED MODEL FOR EXTRACTING INFORMATION FROM ARABIC-BASED CONTROLLED TEXT DOMAINS, DISCUSSING THE INITIAL MODEL STEPS. International Journal of Applied and Natural Sciences, 7(2), 65-86. https://europub.co.uk./articles/-A-275625