Variable Selection and Accurate Predictions in Habitat Modelling: a Shrinkage Approach

Type Article
Date 2017-04
Language English
Author(s) Authier Matthieu1, 5, Saraux Claire2, Peron Clara3, 4, 5
Affiliation(s) 1 : Univ La Rochelle, Observ PELAGIS UMS 3462, La Rochelle, France.
2 : Ifremer Inst Francais Rech Exploitat Mer, UMR MARBEC, Sete, France.
3 : Univ Tasmania, Inst Marine & Antarctic Studies, Kingston, Tas, Australia.
4 : Australian Antarctic Div, Kingston, Tas, Australia.
5 : Ctr Ecol Fonct & Evolut, Ecol Spatiale Populat, Montpellier, France.
Source Ecography (0906-7590) (Wiley), 2017-04 , Vol. 40 , N. 4 , P. 549-560
DOI 10.1111/ecog.01633
WOS© Times Cited 13
Abstract Habitat modelling is increasingly relevant in biodiversity and conservation studies. A typical application is to predict potential zones of specific conservation interest. With many environmental covariates, a large number of models can be investigated but multi-model inference may become impractical. Shrinkage regression overcomes this issue by dealing with the identification and accurate estimation of effect size for prediction. In a Bayesian framework we investigated the use of a shrinkage prior, the Horseshoe, for variable selection in spatial generalized linear models (GLM). As study cases, we considered 5 datasets on small pelagic fish abundance in the Gulf of Lion (Mediterranean Sea, France) and 9 environmental inputs. We compared the predictive performances of a simple kriging model, a full spatial GLM model with independent normal priors for regression coefficients, a full spatial GLM model with a Horseshoe prior for regression coefficients and 2 zero-inflated models (spatial and non-spatial) with a Horseshoe prior. Predictive performances were evaluated by cross-validation on a hold-out subset of the data: models with a Horseshoe prior performed best, and the full model with independent normal priors worst. With an increasing number of inputs, extrapolation quickly became pervasive as we tried to predict from novel combinations of covariate values. By shrinking regression coefficients with a Horseshoe prior, only one model needed to be fitted to the data in order to obtain reasonable and accurate predictions, including extrapolations.
Full Text
File Pages Size Access
Publisher's official version 12 12 MB Open access
Appendix 1. 25 6 MB Open access
Top of the page