MODELING MORPHOLOGICAL ANALYSIS BASED ON WORD-ENDING FOR UZBEK LANGUAGE

Journal Title: International scientific journal Science and Innovation - Year 2023, Vol 2, Issue 11

Abstract

Uzbek, an agglutinative language, forms words by combining affixes with roots, utilizing inflectional endings for various morphological features. This property makes a large number of combinations of word ending, and greatly increases the word-vocabulary size, and data sparseness problems for statistical models. This paper discusses a morphological analyzing model which includes stemming, lemmatizing and extraction of morphological information considering morpho-phonetic exceptions. A main point of the model involves developing a complete set of word-ending with assign morphological information, and additional datasets for morphological analysis. The proposed model was evaluated using a curated test set comprising 5.3K words. It achieved a word-level accuracy over 91%, as determined through manual verification of stem, lemma, and morphological feature corrections conducted by linguistic experts. The created tool based on the proposed methodology is available as an open-source Python package, as well as a web-based application including a public API

Authors and Affiliations

Ulugbek Salaev

Keywords

Related Articles

SOMATIC STATUS AND PHYSICAL DEVELOPMENT OF CHILDREN WITH IDIOPATHIC SCOLIOSIS

Scoliotic disease is a genetically determined pathology of the human musculoskeletal system, which is characterized by a multi–plane deformation of the spine and chest, complicated by disorders of the functions of organs...

STEPS OF TEACHING STUDENTS TO CRITICAL THINKING BASED ON AN INDIVIDUAL APPROACH

In the implementation of any research, the ideas, situations, theories and approaches that are chosen as the methodological basis of the research are considered to be one of the important features. The article describes...

SADNESS AND LOSS REACTIONS AS A RISK OF FORMING A RELATIONSHIP TOGETHER

Modern research confirms the importance of the experience of loss for increased mortality, including cardiovascular disease, especially in the first year after the death of loved ones. Individuals with severe losses ofte...

FEATURES OF ELECTROENCEPHALOGRAPHIC DISORDERS IN PATIENTS WITH MENTAL DISORDERS DUE TO BRAIN DAMAGE OR DYSFUNCTION

According to statistics from recent years, 43.7% of mental pathologies in the structure are mental disorders caused by damage or dysfunction of the brain. Currently, neurofunctional research methods are widely used to di...

THE ROLE OF THE MANAGER IN THE DEVELOPMENT AND IMPLEMENTATION STRATEGY OF COMPULSORY EDUCATION

This article discusses the importance of strategy in the development of general education institutions, the role of the manager in the durable development and implementation of the strategy. The problems of today's schoo...

Download PDF file
  • EP ID EP725418
  • DOI 10.5281/zenodo.10155225
  • Views 45
  • Downloads 0

How To Cite

Ulugbek Salaev (2023). MODELING MORPHOLOGICAL ANALYSIS BASED ON WORD-ENDING FOR UZBEK LANGUAGE. International scientific journal Science and Innovation, 2(11), -. https://europub.co.uk./articles/-A-725418