Document Type
Lecture
Publication Date
10-8-2004
Abstract
Variable selection in the linear regression model takes many apparent faces from both frequentist and Bayesian standpoints. We introduce a variable selection method referred to as a rescaled spike and slab model. We study the importance of prior hierarchical specifications and draw connections to frequentist generalized ridge regression estimation. Specifically, we study the usefulness of continuous bimodal priors to model hypervariance parameters, and the effect scaling has on the posterior mean through its relationship to penalization. We demonstrate the importance of selective shrinkage for effective variable selection in terms of risk misclassification, and show this is achieved using the posterior from a rescaled spike and slab model. The method is illustrated using simulated data and a DNA microarray analysis where we search for differentially expressing genes across samples collected from different stages of colon cancer to liver metastasis. The common feature of the illustrations is the high dimensionality of the predictor space.
Relational Format
presentation
Recommended Citation
Rao, J. Sunil, "Spike and slab variable selection: frequentist and Bayesian strategies (in DNA microarray data analysis)" (2004). Probability & Statistics Seminar. 58.
https://egrove.olemiss.edu/math_statistics/58