Handling Multicollinearity; A Comparative Study Of The Prediction Performance Of Some Methods Based On Some Probabiltiy Distributions
Journal Title: Annals. Computer Science Series - Year 2018, Vol 16, Issue 1
Abstract
This study used some probability distribution (Gamma, Beta and Chi-square distributions) to assess the performance of partial least square regression (PLSR), ridge regression (RR) and LASSO regression (LR) methods. Ordinary Least Squares may fail if the variables are almost collinear or related. As such, this methods (PLSR, RR, AND LR) were compared using simulated data that follows gamma, beta and chi-square distributions with number variables (P=4 and 10) and sample sizes (n=60 and 90). The comparison was carried out using Mean Square Log Error (MSLE), Mean Absolute Error (MAE) and R-Square (R2) which shows that the results of RR is better when P=4 and n=60 using gamma distribution, but using chi square distribution PLRS is better methods. Also, when P=4 and n=90, RR shows better results with both gamma and beta distributions but with chi square distribution all methods have equal predictive ability. However, at P=10 and n=60 RR performed better with both gamma and chi square distributions while when data follows beta distribution all distributions have equal predictive ability. RR shows better results at both gamma and chi square distributions when P=10 and n=90 while PLSR performed better with beta distribution.
Authors and Affiliations
ZAKARI Yahaya ZAKARI, S. A. Yau, U. USMAN
An In-depth Study of Typical Machine Learning Methods via Computational Techniques
The ability to model and perform decision modeling and analysis is an essential feature of many real-world applications ranging from emergency medical treatment in intensive care units to military command and control sys...
Variance Components of Models of Sudoku Square Design
This study aimed at obtaining variance component estimators for all effects of Sudoku square models. The analysis of variance (ANOVA) method was used for the derivation of the variance components for the four Sudoku mode...
Kerberos Authentication in Wireless Sensor Networks
We proposed an authentication mechanism in the wireless sensor network. Sensor network uses the Kerberos authentication scheme for the authentication of bases station in the network. Kerberos provides a centralized authe...
Information Access in the Digital Era
With a fast evolution, a considerable number of applications and global accessibility, the internet is used, nowadays, to gather information from every field of interest, for business dealings, for establishing social re...
Rotation Invariant Skin Detection Approach based on Combination of Probabilistic Distribution Estimation and Single Scale Retinex
Skin detection is one of the main steps in many image processing systems such as face detection, human identicaton, etc. Since now, many methods are proposed to done it accurately. Most of previous methods have tried to...