Dealing with area‐to‐point spatial misalignment in species distribution models

Type Article
Acceptance Date 2024-03-22 IN PRESS
Language English
Author(s) Mourguiart BastienORCID1, 2, Chevalier MathieuORCID1, Marzloff MartinORCID1, Caill‐milly NathalieORCID3, Mengersen KerrieORCID2, 4, 5, Liquet BenoitORCID2, 6
Affiliation(s) 1 : Ifremer, DYNECO Plouzané, France
2 : Laboratoire de Mathématiques et de Leurs Applications, Université de Pau et des Pays de l'Adour, E2S UPPA, CNRS Anglet, France
3 : Ifremer, LITTORAL Anglet, France
4 : ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology Brisbane QLD ,Australia
5 : School of Mathematical Science, Queensland University of Technology Brisbane QLD, Australia
6 : School of Mathematical and Physical Sciences, Macquarie University Sydney NSW ,Australia
Source Ecography (0906-7590) (Wiley) In Press
DOI 10.1111/ecog.07104
Keyword(s) grain size, spatial misalignment, spatial resolution, spatial scale, species distribution modelling, species–environment relationships (SERs)
Abstract

Species distribution models (SDMs) are extensively used to estimate species–environment relationships (SERs) and predict species distribution across space and time. For this purpose, it is key to choose relevant spatial grains for predictor and response variables at the onset of the modelling process. However, environmental variables are often derived from large‐scale climate models at a grain that can be coarser than the one of the response variable. Such area‐to‐point spatial misalignment can bias estimates of SER and jeopardise the robustness of predictions. We used a virtual species approach, running simulations across different levels of area‐to‐point spatial misalignment to seek statistical solutions to this problem. We specifically compared accuracy of SER estimates and predictive performances, assessed across different degrees of spatial heterogeneity in environmental conditions, of three SDMs: a GLM, a spatial GLM and a Berkson error model (BEM) that accounts for fine‐grain environmental heterogeneity within coarse‐grain cells. Only the BEM accurately estimates SER from relatively coarse‐grain environmental data (up to 50 times coarser than the response grain), while the two GLMs provide flattened SER. However, all three models perform poorly when predicting from coarse‐grain data, particularly in environments that are more heterogeneous than the training conditions. Conversely, decreasing environmental heterogeneity relative to the training dataset reduces the predictive biases. Because predictions are made from covariate‐grain data, the BEM displays lower predictive performance than the two GLMs. Thus, standard model selection methods would fail to select the model that best estimates SERs (here, the BEM), which could lead to false interpretations about the environmental drivers of species distributions. Overall, we conclude that the BEM, because it can robustly estimate SER at the response grain, holds great promise to overcome area‐to‐point misalignment.

Licence CC-BY
Full Text
File Pages Size Access
Publisher's official version IN PRESS 14 12 MB Open access
Supplementary Material 12 779 KB Open access
Top of the page