Model selection in linear regression using paired bootstrap


Rabbi F., Khan S., Khalil A., Mashwani W. K. , Shafiq M., Göktaş P., ...More

Communications in Statistics - Theory and Methods, vol.50, no.7, pp.1629-1639, 2021 (Journal Indexed in SCI Expanded) identifier

  • Publication Type: Article / Article
  • Volume: 50 Issue: 7
  • Publication Date: 2021
  • Doi Number: 10.1080/03610926.2020.1725829
  • Title of Journal : Communications in Statistics - Theory and Methods
  • Page Numbers: pp.1629-1639

Abstract

© 2020 Taylor & Francis Group, LLC.Model selection is an important and challenging problem in statistics. The model selection is inevitable in a large number of applications including life sciences, social sciences, business, or economics. In this article, we propose a resampling-based information criterion called paired bootstrap criterion (PBC) for model selection. The proposed criterion is based on minimizing the conditional expected prediction loss for selecting the best subset of variables. We estimate the conditional expected prediction loss by using the out-of-bag (OOB) bootstrap approach. Other classical criteria for model selection such as AIC, BIC are also presented for comparison purpose. We demonstrate that the proposed paired bootstrap model selection criterion is effective in selecting accurate models via real and simulated data examples. The results confirm the satisfactory behavior of the proposed model selection criterion to select parsimonious models that fit the data well. We apply the proposed methodology to a real data example.