The First Parallel Multi-lingual Corpus of Amazigh

Abstract

Amazigh language is the autochthon language of North Africa. However, until 2011 that it became a constitutionally official language in Morocco, after years of persecution. Amazigh language is still considered as one of the under resourced languages. This paper presents the development of a multi-lingual parallel corpus (Amazigh-English-French) aligned on the sentence level. The objective is to be used in linguistic research, teaching, and natural language processing application, primarily machine translation. The paper discusses this aspect, and presents the corpus encoding. A multi-lingual parallel corpus, which brings together Amazigh, English and French, is a new resource for the NLP community that completes the present panorama of parallel corpora. To the best of our knowledge, this corpus is the first Amazigh-English-French multi-lingual parallel corpus. The built corpus is sentence aligned, including 31864 sentences. The alignment was done automatically, while the evaluation was done manually. The evaluation results are satisfactory, achieving more than 90%.

Authors and Affiliations

Fadoua ATAA ALLAH, Nina MIFTAH

Keywords

Related Articles

Wireless Sensors Network Applications In The Behavior Of Environmental Parameters

The Wireless Sensor Network (WSN) has allowed monitoring parameters to improve the production of Precision Agriculture (AP), thus making decision making immediately and improving agricultural production, so this research...

A Review on MQL in Green Environments Used For Machining Process

It is realized that these fluids are a serious damage to the environment and to the health of the operator working with it. Many alternatives were developed to minimize the quantity of cutting fluid used. The purpose of...

Theoretical Calculation Of Surface Energy Of Anatase Tio2 (111) And Zro2(111): An Important Parameter For Osseointegration Of Dental Implants

The characteristics of the materials that are used for dental implants are critical for the success of the implant process. Surface energies of the materials that are used for implant surgery are especially important for...

Mathematical modeling of moving boundaries of phase transition in the process of drying anisotropic plate

https://journals.indexcopernicus.com/representative/article/create?issueId=170869A two-dimensional mathematical model of the process of convection drying of anisotropic porous materials has been onstructed, taking into a...

Predicting the Nonlinear Material Behaviour under Monotonic and Cyclic loading of SA333 and SS3304 Steels

Nonlinearity naturally arises in a true, rigorous mathematical formulation of physical problems. One of the most common nonlinearity is the material nonlinearity where the stress-strain response behaves nonlinearly. The...

Download PDF file
  • EP ID EP394437
  • DOI 10.9790/9622-0806050512.
  • Views 168
  • Downloads 0

How To Cite

Fadoua ATAA ALLAH, Nina MIFTAH (2018). The First Parallel Multi-lingual Corpus of Amazigh. International Journal of engineering Research and Applications, 8(6), 5-12. https://europub.co.uk./articles/-A-394437