Stochastic Gradient Descent with SVM for Imbalanced Data Classification
Journal Title: Scholars Journal of Physics, Mathematics and Statistics - Year 2016, Vol 3, Issue 4
Abstract
Stochastic Gradient Descent (SGD) is an attractive choice for SVM training. SGD leads to a result that the probability of choosing majority class is far greater than that of minority class for imbalanced classification problem. In order to deal with the large-scale imbalanced data classification problems, a method named stochastic gradient descent algorithm with SVM for imbalanced data classification is proposed. First, to deal with imbalanced data classification problems, we define the weight according to the size of positive and negative dataset. Then, a fast learning algorithm on large datasets called the weighted stochastic gradient descent algorithm with SVM is proposed, which helps to reduce the hyperplane offset to the minority class, thus solve the large-scale imbalanced data classification problems. Experimental results on real datasets show that the proposed method is effective.
Authors and Affiliations
Lu Shuxia, Zhu Chenxu, Zhou Mi
Simulation for Performance Analysis of Some Capability Indices on Net-Volume Content of 35cl Coca-Cola Soft Drinks
This paper examined the statistical performance of some capability indices such as Cp and Cpk using simulation. Statistical software R version 3.1.3 was used in generating data sets on net-volume contents of 35cl coca-co...
The Research of Remaining Oil in the Third Southern District by using Numerical Simulation
In order to determine the injection parameters of class â…¡reservoir, according to the laboratory physical simulation experiments, numerical simulation and analysis of field data, study the geological features, sedimentary...
Evaluation of the Development Level of Intelligent City Based on Analytic Hierarchy Process and Fuzzy Forest
The sustainable development of the city is based on the current development mode and rational planning. This paper studies the development of three different types of cities, first, we use a similar analytic approach to...
Transport Modeling at Macro Level: Some Results for Odisha
The central motive of this article is to study adopting multiple regression technique to reflect the effects of some socio-economic indicators on transport system. It is found that socio-economic indicators like agricult...
Logistic Regression Modeling to Isolate Factors that Correlate with Usage of ITN as a Prophylactic to Malaria in Ghana
The study was conducted to isolate factors that correlate with ownership and usage of insecticide treated nets (ITNs) as a prophylactic to malaria in Asamankese, Ghana and explore the policy implications of the findings...