Research of Imbalanced Data Classification in Data Mining

Journal Title: Scholars Journal of Physics, Mathematics and Statistics - Year 2016, Vol 3, Issue 3

Abstract

Classification is one of the most important research contents in data mining and traditional classification methods are relatively mature, when dealing with well-balanced data they can make good performances. But in real world the data is usually imbalanced, that is, most of the data are in majority class and little data are in minority class. Imbalanced data set cause the deduction of the precision of the minority class samples, when it is classified by traditional algorithm, which can tend to favor the more class samples. Making researches on imbalanced datasets are quite important. In order to help readers to have a clear idea of the currently proposed and future work data classification, in view of imbalanced data progress, this paper introduced three developed methods: data level, algorithmic level and developed methods that were the performance evaluation of imbalanced data classification. We are very glad to receive the valuable reference provided by the academics that interested in this field.

Authors and Affiliations

Xin Hua, ZhouShao Hua, Hu Jin Yan

Keywords

Related Articles

A Specific Formula to Compute the Determinant of One Matrix of Order

Let be an matrix, where , In this paper, we establish a specific formula to calculate the determinant of matrix .

Difficulties in license at the Faculty of Sciences

Students in the license (3th academic year) complain if they fail, at the end of academic year, one, two or three courses specifically because, in this case, they must register the year following that for one, two or thr...

Rationalized Haar collocation method for solving singular nonlinear Lane-Emden type equations

In this study we have proposed the Rationalized Haar (RH) collocation method for the solution of Lane-Emden equations arising in astrophysics as singular initial value problems. In order to test the applicability, accura...

A Note on the Application of Wazewski’s Topological Method to an Integro- Differential Equation of Volterra Type

The purpose of this note is to generalize the Wazewski’s Topological Method 11, originally stated for ordinary differential equations, to the integro – differential equation of Volterra type (1), under suitable conditi...

The Stability of the Triangular Points of the Restricted three Body Problem when both the Primaries are Triaxial Rigid Bodies

The location and the stability of the triangular points of the planar restricted three body problem have been discussed when both the primaries are triaxial rigid bodies considering the case of stationary rotational mot...

Download PDF file
  • EP ID EP385468
  • DOI -
  • Views 83
  • Downloads 0

How To Cite

Xin Hua, ZhouShao Hua, Hu Jin Yan (2016). Research of Imbalanced Data Classification in Data Mining. Scholars Journal of Physics, Mathematics and Statistics, 3(3), 117-122. https://europub.co.uk./articles/-A-385468