A joint model for the estimation of species distributions and environmental characteristics from point-referenced data

PLoS One. 2024 Jun 21;19(6):e0304942. doi: 10.1371/journal.pone.0304942. eCollection 2024.

Abstract

Background: Predicting and explaining species occurrence using environmental characteristics is essential for nature conservation and management. Species distribution models consider species occurrence as the dependent variable and environmental conditions as the independent variables. Suitable conditions are estimated based on a sample of species observations, where one assumes that the underlying environmental conditions are known. This is not always the case, as environmental variables at broad spatial scales are regularly extrapolated from point-referenced data. However, treating the predicted environmental conditions as accurate surveys of independent variables at a specific point does not take into account their uncertainty.

Methods: We present a joint hierarchical Bayesian model where models for the environmental variables, rather than a set of predicted values, are input to the species distribution model. All models are fitted together based only on point-referenced observations, which results in a correct propagation of uncertainty. We use 50 plant species representative of the Dutch flora in natural areas with 8 soil condition predictors taken during field visits in the Netherlands as a case study. We compare the proposed model to the standard approach by studying the difference in associations, predicted maps, and cross-validated accuracy.

Findings: We find that there are differences between the two approaches in the estimated association between soil conditions and species occurrence (correlation 0.64-0.84), but the predicted maps are quite similar (correlation 0.82-1.00). The differences are more pronounced in the rarer species. The cross-validated accuracy is substantially better for 5 species out of the 50, and the species can also help to predict the soil characteristics. The estimated associations tend to have a smaller magnitude with more certainty.

Conclusion: These findings suggests that the standard model is often sufficient for prediction, but effort should be taken to develop models which take the uncertainty in the independent variables into account for interpretation.

MeSH terms

  • Bayes Theorem*
  • Conservation of Natural Resources / methods
  • Ecosystem
  • Environment
  • Models, Biological
  • Netherlands
  • Plants
  • Soil / chemistry

Substances

  • Soil

Grants and funding

This research was carried out in the framework of the Strategic Program RIVM (SPR), in which expertise and innovative projects prepare RIVM to respond to future issues in health, the environment and sustainability. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.