Handling Multicollinearity; A Comparative Study Of The Prediction Performance Of Some Methods Based On Some Probabiltiy Distributions

Journal Title: Annals. Computer Science Series - Year 2018, Vol 16, Issue 1

Abstract

This study used some probability distribution (Gamma, Beta and Chi-square distributions) to assess the performance of partial least square regression (PLSR), ridge regression (RR) and LASSO regression (LR) methods. Ordinary Least Squares may fail if the variables are almost collinear or related. As such, this methods (PLSR, RR, AND LR) were compared using simulated data that follows gamma, beta and chi-square distributions with number variables (P=4 and 10) and sample sizes (n=60 and 90). The comparison was carried out using Mean Square Log Error (MSLE), Mean Absolute Error (MAE) and R-Square (R2) which shows that the results of RR is better when P=4 and n=60 using gamma distribution, but using chi square distribution PLRS is better methods. Also, when P=4 and n=90, RR shows better results with both gamma and beta distributions but with chi square distribution all methods have equal predictive ability. However, at P=10 and n=60 RR performed better with both gamma and chi square distributions while when data follows beta distribution all distributions have equal predictive ability. RR shows better results at both gamma and chi square distributions when P=10 and n=90 while PLSR performed better with beta distribution.

Authors and Affiliations

ZAKARI Yahaya ZAKARI, S. A. Yau, U. USMAN

Keywords

Related Articles

Evolving Self-Adaptive Genetic Algorithm in Nonlinear Support Vector Machines for Classification Problems

Support Vector Machines (SVM) has shown a range of promising applications on classification problems. In this paper, we propose the genetic algorithm that employs Self-Adaptive Mutation Rate (SAMR) to develop kernel func...

Topology Management for Wireless Mesh Network

This research formulated and simulated a topology management scheme for wireless mesh network (WMN) in areas of scalability and reliability; considering its vast present limitations in commercialization in many applicati...

Reduction of enhanced maintenance effort using ARM model and RMMM plan

Software maintenance effort is playing a very important role for the development of the software. In maintenance phase user request for change and effort required for the maintenance of software is more as compare to the...

Development of an Enhanced AODV Energy Management model and Link Stability in MANET

A mobile ad hoc network (MANET) nodes move arbitrarily and as a result the networks experience a rapid and unpredictable topology changes. The mobile nodes can receive and forward packets as router which leads to superfl...

Computer applications in clinical psychology

The computer-assisted analysis is not currently a novelty, but a necessity in all areas of psychology. A number of studies that examine the limits of the computer assisted and analyzed interpretations, also its advantage...

Download PDF file
  • EP ID EP521332
  • DOI -
  • Views 68
  • Downloads 0

How To Cite

ZAKARI Yahaya ZAKARI, S. A. Yau, U. USMAN (2018). Handling Multicollinearity; A Comparative Study Of The Prediction Performance Of Some Methods Based On Some Probabiltiy Distributions. Annals. Computer Science Series, 16(1), 15-21. https://europub.co.uk./articles/-A-521332