Empirical Assessment of Ensemble based Approaches to Classify Imbalanced Data in Binary Classification

Abstract

Classifying imbalanced data with traditional classifiers is a huge challenge now-a-days. Imbalance data is a situation wherein the ratio of data within classes is not same. Many real life situations deal with such problems e.g. Web spam detection, Credit card frauds, and Fraudulent telephone calls. The problem exists everywhere when our objective is to identify exceptional cases. The problem is handled by researchers either by modifying the existing classifications methods or by developing new methods. This paper review ensemble based approaches (Boosting and Bagging based) designed to address imbalance in classes by focusing on binary classification. We compared 6 Boosting based, 7 Bagging based and 2 hybrid ensembles for their performance in imbalance domain. We use KEEL tool to evaluate the performance of these methods by implementing the methods on seven imbalance data having class imbalance ratio from 1.82 to as high as 129.44. Area Under the curve (AUC) parameter is recorded as the performance metric. We also statistically analyzed the methods using Friedman rank test and Wilcoxon Matched Pair signed rank test to strengthen the visual interpretations. After analysis, it is proved that RusBoost ensemble outperformed every other ensemble in the imbalanced data situations.

Authors and Affiliations

Prabhjot Kaur, Anjana Gosain

Keywords

Related Articles

Object’s Shape Recognition using Local Binary Patterns

This paper discusses the concept of object’s shape identification using local binary pattern technique (LBP). Since LBP is computationally simple it has been utilized successfully for recognition of various objects. LBP...

Deployment Protocol for Underwater Wireless Sensors Network based on Virtual Force

Recently, Underwater Sensor Networks (UWSNs) have attracted researchers’ attention due to the challenges and the peculiar characteristics of the underwater environment. The initial random deployment of UWSN where sensors...

The Impact and Challenges of Cloud Computing Adoption on Public Universities in Southwestern Nigeria

This study investigates the impact and challenges of the adoption of cloud computing by public universities in the Southwestern part of Nigeria. A sample size of 100 IT staff, 50 para-IT staff and 50 students were select...

Performance Analysis of Proposed Congestion Avoiding Protocol for IEEE 802.11s

The wireless technology is one of the core compo-nents of mobile applications with mobility support at low deploy-ment costs. Among these, Wireless Mesh Network (WMN) is one of the technologies that supports mobile users...

Scientific Articles Exploration System Model based in Immersive Virtual Reality and Natural Language Processing Techniques

After having carried out a historical review and identifying the state of the art in relation to the interfaces for the exploration of scientific articles, the authors propose a model based in an immersive virtual enviro...

Download PDF file
  • EP ID EP498374
  • DOI 10.14569/IJACSA.2019.0100307
  • Views 81
  • Downloads 0

How To Cite

Prabhjot Kaur, Anjana Gosain (2019). Empirical Assessment of Ensemble based Approaches to Classify Imbalanced Data in Binary Classification. International Journal of Advanced Computer Science & Applications, 10(3), 48-58. https://europub.co.uk./articles/-A-498374