Dynamic Programming Method Applied in Vietnamese Word Segmentation Based on Mutual Information among Syllables

Abstract

 Vietnamese word segmentation is an important step in Vietnamese natural language processing such as text categorization, text summary, and automated machine translation. The problem with Vietnamese word segmentation is complicated because Vietnamese words are not always separated by a space. One word can include one or more syllables depending on the context. This paper proposes a method for Vietnamese word segmentation based on the mutual information among the syllables combined with dynamic programming. With this method, we can achieve an accuracy rate of about 90% with a raw text corpus.

Authors and Affiliations

Nguyen Uyen, Tran Sang

Keywords

Related Articles

 Human Lips-Contour Recognition and Tracing

 Human-lip detection is an important criterion for many automated modern system in present day. Like computerized speech reading, face recognition etc. system can work more precisely if human-lip can detect accurate...

 A More Intelligent Literature Search

 Although the topic of study relates to an environmental/health issue, it is the methodology described which serves to showcase an embryonic form of a new “more intelligent” protocol of search algorithm. Through the...

 Method for 3D Image Representation with Reducing the Number of Frames based on Characteristics of Human Eyes

 Method for 3D image representation with reducing the number of frames based on characteristics of human eyes is proposed together with representation of 3D depth by changing the pixel transparency. Through experime...

 Factor Analysis Based Selections

 Merger in higher education has been of scholarly interest to researchers in various fields. This work is devoted to challenges related to partner selection for an feasible merger. A systematic approach is proposed...

An interactive Tool for Writer Identification based on Offline Text Dependent Approach

Writer identification is the process of identifying the writer of the document based on their handwriting. The growth of computational engineering, artificial intelligence and pattern recognition fields owes greatly to o...

Download PDF file
  • EP ID EP147576
  • DOI 10.14569/IJARAI.2014.030904
  • Views 135
  • Downloads 0

How To Cite

Nguyen Uyen, Tran Sang (2014).  Dynamic Programming Method Applied in Vietnamese Word Segmentation Based on Mutual Information among Syllables. International Journal of Advanced Research in Artificial Intelligence(IJARAI), 3(9), 24-27. https://europub.co.uk./articles/-A-147576