Alternative Estimator for Multivariate Location and Scatter Matrix in the Presence of Outlier

Journal Title: Annals. Computer Science Series - Year 2018, Vol 16, Issue 2

Abstract

It is generally known that in estimating location and scatter matrix of multivariate data when outliers are presents, the method of classical is not robust. The Maximum Likelihood Estimator (MLE) is always very sensitive to some deviations from the assumptions made on the data, especially, presence of outliers. To get over the above stated problem, many alternative estimators that are robust have been proposed in the last decades. Some of these estimators include the Minimum Covariance Determinant (MCD), the Minimum Volume Ellipsoid (MVE), S-Estimators, M-Estimators and Minimum Regularized Covariance Determinant (MRCD) among others. All the methods converged on tackling the problem of robust estimation by finding a sufficiently large subset of the data. In this paper, a robust method of estimating multivariate location and scatter matrix in the presence of outliers is proposed. The proposed estimator is obtained using the best units (samples) from the available data set that satisfied a set of three optimality criteria (CA,CH,CG).The performance of the proposed robust method was compared with two of the existing robust methods (MCD and MVE) and the classical method with their application in Principal component analysis data simulation. The measure of performance used was the Mean Square Errors (MSE) of the characteristic roots (eigen-values) of the variance-covariance matrix. Generally, the proposed alternative method is better than other robust methods and classical method, when the level of magnitude of outliers is small and also performed considerably well with MCD and MVE when the level of magnitude is high at all percentages of outliers.

Authors and Affiliations

Oluwafemi Samuel OBAFEMI, Gafar Matanmi OYEYEMI

Keywords

Related Articles

An Exploratory Study of Critical Factors Affecting the Efficiency of Sorting Techniques (Shell, Heap and Treap)

The efficiency of sorting techniques has a significant impact on the overall efficiency of a program. The efficiency of Shell, Heap and Treap sorting techniques in terms of both running time and memory usage was studied,...

Prior Specification in Bayesian Model Averaging: An application to Economic Growth

Some recent cross-country cross-sectional analyses have employed Bayesian Model Averaging to tackle the issue of model uncertainty. Bayesian model averaging has become an important tool in empirical settings with large n...

Information Access in the Digital Era

With a fast evolution, a considerable number of applications and global accessibility, the internet is used, nowadays, to gather information from every field of interest, for business dealings, for establishing social re...

Handling Multicollinearity; A Comparative Study Of The Prediction Performance Of Some Methods Based On Some Probabiltiy Distributions

This study used some probability distribution (Gamma, Beta and Chi-square distributions) to assess the performance of partial least square regression (PLSR), ridge regression (RR) and LASSO regression (LR) methods. Ordin...

Homogenous Ensembles of Data Mining Algorithms in Predicting Liver Disease

Application of data mining algorithms to medical fields have been of interest as it helps patients get access to a better and faster healthcare. In this study, the effect of homogenous ensemble methods of bagging and boo...

Download PDF file
  • EP ID EP550066
  • DOI -
  • Views 87
  • Downloads 0

How To Cite

Oluwafemi Samuel OBAFEMI, Gafar Matanmi OYEYEMI (2018). Alternative Estimator for Multivariate Location and Scatter Matrix in the Presence of Outlier. Annals. Computer Science Series, 16(2), 130-136. https://europub.co.uk./articles/-A-550066