On the Choice of Linear Regression Algorithms for Biological and Ecological Applications
Journal Title: Annual Research & Review in Biology - Year 2016, Vol 10, Issue 3
Abstract
Model II regression (i.e. minimizing residuals obliquely) is the adequate alternative to Model I regression by Ordinary Least Squares (i.e. minimizing residuals vertically) given the absence of well-established dependence relationships or x measured with error. Yet, it has no perfect solution. Determining the true slope from errors-in-the-variables models requires the errors in x and y estimated from higher order moments. However, their accurate estimation requires enormous data sets and thus they are not applicable to most ecological problems. The alternative Reduced Major Axis (RMA) is dependent on a strict set of assumptions, hardly met with real data, making it prone to bias, whereas Principal Components Analysis (PCA) becomes less reliable with decreasing correlations while x and y presenting approximate variances. We used artificial data (allowing for the determination of the true slope) to demonstrate when RMA or PCA should be preferred. Consequently, we propose using PCA whenever r2+s2x/s2y is higher than 1.5. Otherwise, we suggest generating artificial data manipulated to match the structure of the original, and to test which method provides closer estimates to the input true slope. We provide a user-friendly script to perform this task. We tested the use of RMA and PCA with real data about intraspecific and interspecific biomass-density relations in algae and seagrass, algae frond growth, crustacean and bird morphometry, sardine fisheries and social sciences data, commonly finding widely divergent slope estimates leading to severely biased parameter estimations and model applications. Their analyses support the suggested approach for method selection summarized above.
Authors and Affiliations
Vasco M. N. C. S. Vieira, Joel Creed, Ricardo A. Scrosati, Anabela Santos, Georg Dutschke, Francisco Leitão, Aschwin H. Engelen, Oscar R. Huanel, Marie-Laure Guillemin, Marcos Mateus, Ramiro Neves
Spatial and Temporal Variability of Environmental Radioactivity in Basra and Baghdad Cities, Iraq
Introduction: This research focused on study of the spatial and temporal variability of environmental radioactivity and impact of the pollution on the human health in Basra and Baghdad cities, Iraq. Materials and Method...
Assessment of Physico-Chemical Characteristics and Trace Metal Contents of Drinking Water Samples of District Tiruchirappalli
The analytical data of various physico-chemical parameters indicates that some parameters like pH, Hardness, Calcium, Magnesium, Electrical Conductivity, DO, Chloride, Total Alkalinity, Iron are found to be excess than t...
Evaluation of Maize Inbred Lines for Iranian maize mosaic virus (IMMV) Resistance
In present study, the putative resistance capacity of thirty five maize inbred lines against Iranian maize mosaic virus (IMMV) was studied. Reaction to IMMV was analyzed under natural field infection and controlled condi...
Lipid Profile, Cortisol and Haematological Alterations in Simulated Microgravity Using the Bat Model
The mode of blood lipid and cortisol expression in simulated microgravity has been poorly understood. This study determined the influence of simulated microgravity (prolonged inversion) on the level of expression of seru...
Diverse Genetic Screening and Counseling throughout the Iranian Population
Background and Aim: The recent decades have witnessed increasing possibilities for genetic testing and screening. In the Iran, since the 1970s, individuals and their family members could obtain genetic counselling for th...