Nearest neighbour distance matching Leave-One-Out Cross-Validation for map validation

dc.contributor.authorMilĂ , Carles
dc.contributor.authorMateu, Jorge
dc.contributor.authorPebesma, Edzer
dc.contributor.authorMeyer, Hannah V.
dc.date.accessioned2022-12-13T06:56:34Z
dc.date.available2022-12-13T06:56:34Z
dc.date.issued2022
dc.description.abstractSeveral spatial and non-spatial Cross-Validation (CV) methods have been used to perform map validation when additional sampling for validation purposes is not possible, yet it is unclear in which situations one CV method might be preferred over the other. Three factors have been identified as determinants of the performance of CV methods for map validation: the prediction area (geographical interpolation vs. extrapolation), the sampling pattern and the landscape spatial autocorrelation. In this study, we propose a new CV strategy that takes the geographical prediction space into account, and test how the new method compares with other established CV methods under different configurations of these three factors. We propose a variation of Leave-One-Out (LOO) CV for map validation, called Nearest Neighbour Distance Matching (NNDM) LOO CV, in which the nearest neighbour distance distribution function between the test and training data during the CV process is matched to the nearest neighbour distance distribution function between the target prediction and training points. Using random forest as a machine learning algorithm, we then examine the suitability of NNDM LOO CV as well as the established LOO (non-spatial) and buffered-LOO (bLOO, spatial) CV methods in two simulations with varying prediction areas, landscape autocorrelation and sampling distributions. LOO CV provided good map accuracy estimates in landscapes with short autocorrelation ranges, or when estimating geographical interpolation map accuracy with randomly distributed samples. bLOO CV yielded realistic error estimates when estimating map accuracy in new prediction areas, but generally overestimated geographical interpolation errors. NNDM LOO CV returned reliable estimates in all scenarios we considered. While LOO and bLOO CV provided reliable map accuracy estimates only in certain situations, our newly proposed NNDM LOO CV method returned robust estimates and generalised to LOO and bLOO CV whenever these methods were the most appropriate approach. Our work recognises the necessity of considering the geographical prediction space when designing CV-based methods for map validation.
dc.format.mimetypeapplication/pdf
dc.identifier.citationMilĂ  C, Mateu J, Pebesma E, Meyer H. Nearest neighbour distance matching Leave-One-Out Cross-Validation for map validation. Methods in Ecology and Evolution. 2022;13(6):1304-16. DOI: 10.1111/2041-210X.13851
dc.identifier.doihttp://dx.doi.org/10.1111/2041-210X.13851
dc.identifier.issn2041-210X
dc.identifier.urihttp://hdl.handle.net/10230/55108
dc.language.isoeng
dc.publisherWiley
dc.relation.ispartofMethods in Ecology and Evolution. 2022;13(6):1304-16
dc.rights© 2022 The Authors. Methods in Ecology and Evolution published by John Wiley & Sons Ltd on behalf of British Ecological Society. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subject.keywordCross-Validation
dc.subject.keywordMap accuracy estimation
dc.subject.keywordMap validation
dc.subject.keywordSpatial point patterns
dc.subject.keywordSpatial prediction
dc.titleNearest neighbour distance matching Leave-One-Out Cross-Validation for map validation
dc.typeinfo:eu-repo/semantics/article
dc.type.versioninfo:eu-repo/semantics/publishedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Mila_mee_near.pdf
Size:
5.83 MB
Format:
Adobe Portable Document Format

License

Rights