Document Type

Article

Publication Date

2016

Abstract

This study presents comparisons of subset selection criteria used to help determine the "best" regression model in multiple linear regression. No such criteria can replace a researcher's knowledge of theory to help choose useful models, but such criteria may help in exploratory research. Relationships among variables can be more complicated than expected and may require adjustment to theory based on empirical models. Knowing how well each subset selection method performs can be useful in such cases. Monte Carlo simulations were performed to compare a number of well-known criteria (e.g., AIC, BIC, PRESS, adjusted R² ) to some less well-known criteria (e.g., AICc, AICu, GCV, cross-validity R²). We found that although none of the criteria work well to identify a single correct model across a large number of coefficient and multicollinearity patterns, AIC and adjusted R² work reasonably well enough to recommend, in combination, to identify a model within-one of the correct model.