Cervical Cancer Prediction through Different Screening Methods using Data Mining
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2019, Vol 10, Issue 2
Abstract
Cervical cancer remains an important reason of deaths worldwide because effective access to cervical screening methods is a big challenge. Data mining techniques including decision tree algorithms are used in biomedical research for predictive analysis. The imbalanced dataset was obtained from the dataset archive belongs to the University of California, Irvine. Synthetic Minority Oversampling Technique (SMOTE) has been used to balance the dataset in which the number of instances has increased. The dataset consists of patient age, number of pregnancies, contraceptives usage, smoking patterns and chronological records of sexually transmitted diseases (STDs). Microsoft azure machine learning tool was used for simulation of results. This paper mainly focuses on cervical cancer prediction through different screening methods using data mining techniques like Boosted decision tree, decision forest and decision jungle algorithms as well performance evaluation has done on the basis of AUROC (Area under Receiver operating characteristic) curve, accuracy, specificity and sensitivity. 10-fold cross-validation method was utilized to authenticate the results and Boosted decision tree has given the best results. Boosted decision tree provided very high prediction with 0.978 on AUROC curve while Hinslemann screening method has used. The results obtained by other classifiers were significantly worse than boosted decision tree.
Authors and Affiliations
Muhammad Talha Alam, Muhammad Milhan Afzal Khan, Muhammad Atif Iqbal, Abdul Wahab, Mubbashar Mushtaq
Correlated Topic Model for Web Services Ranking
With the increasing number of published Web services providing similar functionalities, it’s very tedious for a service consumer to make decision to select the appropriate one according to her/his needs. In this paper, w...
Implementation of a Neural Network Using Simulator and Petri Nets*
This paper describes construction of multilayer perceptron by open source neural networks simulator - Neuroph and Petri net. The described multilayer perceptron solves logical function "xor "- exclusive or. The aim is to...
LeafPopDown: Leaf Popular Down Caching Strategy for Information-Centric Networking
Information-Centric Networking is a name based internet architecture and is considered as an alternate of IP base internet architecture. The in-network caching feature used in ICN has attracted research interests as it r...
Evaluating Dependency based Package-level Metrics for Multi-objective Maintenance Tasks
Role of packages in organization and maintenance of software systems has acquired vital importance in recent research of software quality. With an advancement in modularization approaches of object oriented software, pac...
Automatic Detection Technique for Speech Recognition based on Neural Networks Inter-Disciplinary
Automatic speech recognition allows the machine to understand and process information provided orally by a human user. It consists of using matching techniques to compare a sound wave to a set of samples, usually compose...