Comparative Study of Three Imputation Methods to Treat Missing Values
Journal Title: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY - Year 2013, Vol 11, Issue 7
Abstract
One relevant problem in data preprocessing is the presence of missing data that leads the poor quality of patterns, extracted after mining. Imputation is one of the widely used procedures that replace the missing values in a data set by some probable values. The advantage of this approach is that the missing data treatment is independent of the learning algorithm used. This allows the user to select the most suitable imputation method for each situation. This paper analyzes the various imputation methods proposed in the field of statistics with respect to data mining. A comparative analysis of three different imputation approaches which can be used to impute missing attribute values in data mining are given that shows the most promising method. An artificial input data (of numeric type) file of 1000 records is used to investigate the performance of these methods. For testing the significance of these methods Z-test approach were used.
Authors and Affiliations
Rahul Singhai
A Review on Ontology Learning Approaches of Creating a Topic Map of Cybercrime Research
Conducting an academic research requires getting a firm grasp of ongoing research issues as well as locating research materials effectively. Often research in different fields on a similar topic can assume diverse approa...
A Novel SLM based PAPR reduction Technique in OFDM-MIMO System
Orthogonal Frequency Division Multiplexing (OFDM) is annew method for fourth generation wireless communication.MIMO-OFDM has become a promising candidate for highperformance 4G broadband wireless communications.However,...
Nano Topological Analysis For Power System Control
In this paper,we introduce an approach for analysis of information concerning electrical power system. The suggested method is a result of hybridizing rough set concepts with nano topology constructed on the set of all d...
Spline Computation for Solving Magnetohidrodynamics Free Convection Flow
In this paper, we construct numerical algorithms for solving Magnetohidrodynamics(MHD) free convection flow rate whichhas been discussed in detail. It is observed that, for a nonlinear system of differential equation, th...
A Survey on Cloud Computing and Its Benefits
Cloud computing is Internet based development and use of computer technology. It is a style of computing in which dynamically scalable and often virtualized resources are provided as a service over the Internet. Users ne...