Integrating uncertain prior knowledge regarding ecological preferences into multi-species distribution models: Effects of model complexity on predictive performance

2020 
Abstract Species distribution models (SDMs) are often criticised for lacking explicit linkage to ecological concepts. We aim to improve the ecological basis of SDMs by integrating prior knowledge about ecological preferences of organisms. Additionally, we aim to support a systematic, data-driven review of such prior knowledge by confronting it with independent monitoring data using Bayesian inference. We developed a series of multi-species distribution models (MSDMs) with increasing complexity to predict the probability of occurrence of taxa at sampling sites based on habitat suitability functions that are parameterized with prior ecological knowledge. We subsequently assessed the models` predictive performance with 3-fold cross-validation. So far, if ecological preferences or functional traits have been used in SDMs, they were mainly used as fixed inputs without considering their uncertainty. We take the additional step of considering uncertainty about preference parameters by including them as uncertain prior information that is subsequently updated with Bayesian inference. We apply the series of models in a case study on macroinvertebrates in Swiss streams. We analyse differences in the quality of fit, changes in predictive performance, and the potential to learn about the parameters from the data. We consider ecological preferences for natural and human modified environmental factors including temperature, flow velocity, organic matter concentration, insecticide pollution, and substratum. Results indicate that updating prior knowledge on ecological preferences with Bayesian inference, rather than using it as fixed input, improves model fit and predictive performance. For example, the predictive performance measured by the deviance for validation data improves by 17 % and the explanatory power increases 3.8 times from a model that treats ecological preferences as fixed scores to a model that treats them as uncertain parameters. The spatial distribution of many taxa, including rare taxa with frequencies of occurrence down to about 5 %, which are difficult to model with SDMs that do not consider prior information, can be captured by the new models. Integrating prior knowledge as uncertain parameters in a Bayesian framework establishes ecological interpretable links between taxa and their environment and supports a systematic revision and complementation of databases on ecological preferences, even in case of poor or missing prior knowledge. Model outputs need to be carefully interpreted by modellers and experts on ecological preferences. Increased exchange between these research fields will benefit further integration of ecological preferences into SDMs.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    52
    References
    5
    Citations
    NaN
    KQI
    []
    Baidu
    map