2020 ESA Annual Meeting (August 3 - 6)

COS 3 Abstract - Correcting for spatial sampling bias in biological records data and modeling millipede distributions

Willson Gaul1, Dinara Sadykova2, Niall Keogh3, Hannah J. White1, Lupe León-Sánchez2, Paul Caplat2, Mark C. Emmerson2, Tomás E. Murray4 and Jon M. Yearsley1, (1)School of Biology and Environmental Sciences, University College Dublin, Dublin, Ireland, (2)School of Biological Sciences, Queen's University Belfast, Belfast, United Kingdom, (3)Marine and Freshwater Research Centre, Galway-Mayo Institute of Technology, Galway, Ireland, (4)National Biodiversity Data Centre, Waterford, Ireland
Background/Question/Methods

Biological records (citizen science observations of species) provide information about species occurrences over large spatial scales, but the data are spatially patchy with lots of data from some locations and no data from others (spatially biased data). Species distribution models (SDMs) can “fill in” unsampled locations by predicting which species are present, but only if models can make good predictions despite being trained with spatially biased data. We tested the effect of spatial sampling bias, sample size, and the choice of species distribution modeling method on predictions of species distribution models when applied at the spatial scale of Ireland. We present two lines of evidence, a simulation study and a study in which we manipulated biological records of birds in Ireland to introduce spatial biases similar to those found in data for other taxonomic groups. Based on those results, we produced species distribution models for Ireland that fill in knowledge gaps for millipedes (Diplopoda), a taxonomic group for which sampling in Ireland is spatially patchy.

Results/Conclusions

1) Spatial bias at an intensity comparable to that found in average Irish biological records datasets did not hurt predictive performance of SDMs compared to when models were trained with minimally biased data.

2) Increasing the total amount of data is more important than increasing the spatial evenness of data when the goal is to predict species occurrence using species distribution modeling.

3) Species distribution models for selected millipede species showed adequate performance, but sample size may still limit our ability to model many millipede species in Ireland. For the purpose of predictive species distribution modeling, increasing the total number of millipede records is likely more important than increasing the spatial evenness of recording.