Imputation And Classification Of Missing Data Using Least Square Support Vector Machines – A New Approach In Dementia Diagnosis
Journal Title: International Journal of Advanced Research in Artificial Intelligence(IJARAI) - Year 2012, Vol 1, Issue 4
Abstract
This paper presents a comparison of different data imputation approaches used in filling missing data and proposes a combined approach to estimate accurately missing attribute values in a patient database. The present study suggests a more robust technique that is likely to supply a value closer to the one that is missing for effective classification and diagnosis. Initially data is clustered and z-score method is used to select possible values of an instance with missing attribute values. Then multiple imputation method using LSSVM (Least Squares Support Vector Machine) is applied to select the most appropriate values for the missing attributes. Five imputed datasets have been used to demonstrate the performance of the proposed method. Experimental results show that our method outperforms conventional methods of multiple imputation and mean substitution. Moreover, the proposed method CZLSSVM (Clustered Z-score Least Square Support Vector Machine) has been evaluated in two classification problems for incomplete data. The efficacy of the imputation methods have been evaluated using LSSVM classifier. Experimental results indicate that accuracy of the classification is increases with CZLSSVM in the case of missing attribute value estimation. It is found that CZLSSVM outperforms other data imputation approaches like decision tree, rough sets and artificial neural networks, K-NN (K-Nearest Neighbour) and SVM. Further it is observed that CZLSSVM yields 95 per cent accuracy and prediction capability than other methods included and tested in the study.
Authors and Affiliations
T Sivapriya, A. R. Banu Kamal, V. Thavavel
Realising Dynamism in MediaSense Publish/Subscribe Model for Logical-Clustering in Crowdsourcing
The upsurge of social networks, mobile devices, Internet or Web-enabled services have enabled unprecedented level of human participation in pervasive computing which is coined as crowdsourcing. The pervasiveness of...
The Need for a New Data Processing Interface for Digital Forensic Examination
Digital forensic science provides tools, techniques and scientifically proven methods that can be used to acquire and analyze digital evidence. There is a need for law enforcement agencies, government and private organis...
System for EKG Monitoring
In this paper the system for the electrocardiogram (EKG) monitoring based on the of Arduino microcontroller is presented. Detailed description of the electrocardiogram itself serves as a ground for building the pro...
Optimisation of Resource Scheduling in VCIM Systems Using Genetic Algorithm
The concept of Virtual Computer-Integrated Manufacturing (VCIM) has been proposed for one and a half decade with purpose of overcoming the limitation of traditional Computer-Integrated Manufacturing (CIM) as it only work...
The True Nature of Numbers is that they are a Group Associated with the Painlevé Property
The true nature of numbers is that they are a group associated with the moving Painlevé property. In the past, humans considered numbers to be individual entities. The two-point selective ability of living beings c...