New Feature Selection Method of Uyghur Text Classification

Journal Title: 河南科技大学学报(自然科学版) - Year 2016, Vol 37, Issue 3

Abstract

In order to deal with the insufficient consideration of the traditional chi-square statistic method in thefrequency and category distribution of feature items,a new Chi-square statistic feature selection method combined with the cosine similarity was proposed. Firstly,the mean term frequency-inverse document frequency( TF-IDF) was used to represent the features,and the selected feature items was balanced by introducing a adjustment formula. Thus the traditional chi-square statistic method was modified. Then the noise text was eliminated further by cosine similarity. Finally,a demonstration experiment was established on the collected Uyghur data set. The results show that the improved chi-square test method has better robustness. The classification performance is superior to the traditional chi-square statistic method.

Authors and Affiliations

Yan HE, HALIDAN• Abudureyimu, ALIYA• Aierken, Bingbing WU

Keywords

Related Articles

Design of Embedded Intelligent Vision System

In view of the high complexity of image recognition algorithm and poor robustness of present embedded vision system, the embedded intelligent vision system was proposed. The advanced RISC machine ( ARM) Cortex-A53 ( Allw...

Hopf Bifurcation Analysis of Time Lag Lü System

In order to further improve the actual fitting degree of nonlinear Lü system and reduce the uncertain conditions of the system, the Hopf bifurcation theory was combined with the time lag factor existing in the system to...

Design and Measurement of Near-Field-Focused Planar Arrays

In order to develop the focusing antenna used in the field of microwave hyperthermia, based on the principle of electromagnetic focusing, 4 × 4 near field focused planar microstrip array antenna with an operating frequen...

Construction Disturbance Analysis of Existing Bridge Piles Traversed by Interval Subsurface Excavation Tunnel

The construction disturbance of existing bridge piles traversed by interval subsurface excavation tunnel was analyzed by 3D numerical analysis and site measurement. The results show that the spatial effect of existing br...

Download PDF file
  • EP ID EP461277
  • DOI 10.15926/j.cnki.issn1672-6871.2016.03.010
  • Views 73
  • Downloads 0

How To Cite

Yan HE, HALIDAN• Abudureyimu, ALIYA• Aierken, Bingbing WU (2016). New Feature Selection Method of Uyghur Text Classification. 河南科技大学学报(自然科学版), 37(3), 42-46. https://europub.co.uk./articles/-A-461277