Optimization Techniques for SCAD Variable Selection in Medical Research
Journal Title: Biomedical Journal of Scientific & Technical Research (BJSTR) - Year 2018, Vol 8, Issue 2
Abstract
High-dimensional data analysis requires variable selection to identify truly relevant variables. More often it is done implicitly via regularization, such as penalized regression. Of the many versions of penalties, SCAD has shown good properties and has been widely adopted in medical research and many more areas. This paper reviews the various optimization techniques in solving SCAD penalized regression. High-dimensional data analysis has been a common and important topic in biomedical/genomic/clinical studies. For example, the identification of genetic factors for complex diseases such as lung cancer implicates a variety of genetic variants. For high-dimensional data, there is the well-known problem of curse of dimensionality arising in modeling. Therefore, variable selection is a fundamental task for high-dimensional statistical modeling. The "old school" way of doing variable selection is to follow a subset selection procedure prior to building the model of interest. The procedure commonly adopts AIC/BIC as evaluation metric and often iterates in a stepwise fashion. Yet this is independent of the subsequent modeling task hence the effectiveness might be less desirable. A more natural way is to integrate the variable selection into the modeling itself, i.e., the penalized regression, which simultaneously performs variable selection and coefficient estimation. Theoretically, the "best" penalty for the penalized regression is the number of non-zero variables, to push as many variables to zero as possible. Yet, it is well known that the L0 (also known as the entropy penalty) optimization [1] is infeasible. As such, the L1 (LASSO) penalty Tibshirani [2] is our "next best" candidate, which is widely adopted in statistical and machine learning community for sparse solutions. However, [3] point out that L1 suffers the problem of biasedness. They propose the Smoothly Clipped Absolute Deviation (SCAD) penalty that can produce unbiased estimates while retaining good properties of L1. Subsequently, the SCAD penalty function has seen a wide range of applications including medical/clinical research, such as [1,4-7]. Nevertheless, the estimating procedure for SCAD penalized regression is no trivial task, because the target function a) is a high-dimensional non-concave function, b) is singular at the origin, c) does not have continuous second order derivatives.
Authors and Affiliations
Yan Fang, Yan Yan Kong, Yumei Jiao
Toxoplasmosis Frequency in Pregnant Women Attending Regional Health Centers in the City of Samambaia of the Federal District, Brazil
Toxoplasmosis is caused by T. gondii, with prevalence from 20 to 90% in the world population. In pregnant women, the presence of fetal infection reaches 40%. The aim of the study was to determine the frequency and risk f...
Various Improved Approximations to Distributions of Quadratic Test Statistics for Dependent Rank Sums
This paper presents a modified chi-square approximation to the distribution of test statistics arising from multivariate ranked data. The modification arises from an improvement to the estimated variance matrix of the re...
FC for Relapsed/Refractory Diffuse Large B Cell Lymphoma
Despite overall improvements, relapsed/refractory diffuse large B-cell lymphoma (DLBCL)remains a major cause of morbidity [1]. Treatment option for relapsed/refractory DLBCL is salvage chemotherapy followed by an autolog...
Hypersensitivity to Intravenous Ondansetron
A 33 year old male who was recovering after cholecystectomy presented a sudden onset of breathing difficulty and rashes all over the body with inj. Andtron (Ondansetron -4mg). The condition progressed to cardio pulmonary...
Extraction Optimization for Phenols and Flavonoids from Cultured Mycelia of Cordyceps Ophioglossoides and Exploration of Bioactivities of its Aqueous and Ethanol Extracts
Aqueous and ethanol extracts from the mycelia of Cordyceps Ophioglossoides have been used as a nutritional supplement, especially for women suffering from massive postpartum vaginal bleeding in Southwest China. However,...