Mortality Prediction based on Imbalanced New Born and Perinatal Period Data

Abstract

This study was carried out by the New York State Department of Health, between 2012 and 2016. This experiment relates to six supervised machine learning methods: Support Vector Machine (SVM), Logistic Regression (LR), Gradient Boosting (GB), Random Forest (RF), Deep Learning (DL) and the Ensemble Model, all of which are used in the prediction of infant mortality. This experiment applied ensemble model that concentrated on assigning different weights to different models per output class in order to obtain a better predictive performance for infant mortality. Efforts were made to measure the performance and compare the classifier accuracy of each model. Several criteria, including the area under ROC curve, were considered when comparing the ensemble model (GB, RF and DL) with the other five models (SVM, LR, DL, GB and RF). In terms of these different criteria, the ensemble model outperformed the others in predicting survival rates among infant patients given a balanced data set (the areas under the ROC curve for minor, moderate, major and extreme were 98%, 95%, 92% and 97% respectively, giving a total accuracy of 80.65%). For the imbalanced dataset, (the areas under the ROC curve for minor, moderate, major and extreme were 98%, 98%, 99% and 99% respectively, giving total accuracy increased to 97.44%). The results of the experiments used in this dissertation showed that using the ensemble model provided a better level of prediction for infant mortality than the other five models, based on the relative prediction accuracy for each model for each output class. Therefore, the ensemble model provides and extremely promises classifier in terms of predicting infant mortality.

Authors and Affiliations

Wafa M. AlShwaish, Maali Ibr. Alabdulhafith

Keywords

Related Articles

Let’s Code: A Kid-friendly Interactive Application Designed to Teach Arabic-speaking Children Text-based Programming

Programming is the cornerstone for the development of all of the technologies we encounter in our daily lives. It also plays an important role in enhancing creativity, problem-solving, and logical thinking. Due to the im...

Sensitivity Analysis of Fourier Transformation Spectrometer: FTS Against Observation Noise on Retrievals of Carbon Dioxide and Methane

Sensitivity analysis of Fourier Transformation Spectrometer: FTS against observation noise on retrievals of carbon dioxide and methane is conducted. Through experiments with real observed data and additive noise, it is f...

ABJAD Arabic-Based Encryption

The researcher introduced an enhanced classical Arabic-based encryption technique that is essentially designed for Arab nations. The new algorithm uses the shared key technique where the Keyword system Modulus is employe...

Arabic Text Categorization using Machine Learning Approaches

Arabic Text categorization is considered one of the severe problems in classification using machine learning algorithms. Achieving high accuracy in Arabic text categorization depends on the preprocessing techniques used...

The Information-Seeking Problem in Human-Technology Interaction

In the history of information-seeking, the intention of a query and the posed query have some level of distance between them. Because human query-responders are innately connected to times and trends and have the ability...

Download PDF file
  • EP ID EP626542
  • DOI 10.14569/IJACSA.2019.0100808
  • Views 92
  • Downloads 0

How To Cite

Wafa M. AlShwaish, Maali Ibr. Alabdulhafith (2019). Mortality Prediction based on Imbalanced New Born and Perinatal Period Data. International Journal of Advanced Computer Science & Applications, 10(8), 51-60. https://europub.co.uk./articles/-A-626542