Please use this identifier to cite or link to this item: http://dx.doi.org/10.25673/118039
Title: Effect of training sample size, sampling design and prediction model on soil mapping with proximal sensing data for precision liming
Author(s): Schmidinger, Jonas
Schröter, Ingmar
Bönecke, EricLook up in the Integrated Authority File of the German National Library
Gebbers, RobinLook up in the Integrated Authority File of the German National Library
Ruehlmann, Joerg
Kramer, EckartLook up in the Integrated Authority File of the German National Library
Mulder, Vera L.
Heuvelink, Gerard B. M.Look up in the Integrated Authority File of the German National Library
Vogel, SebastianLook up in the Integrated Authority File of the German National Library
Issue Date: 2024
Type: Article
Language: English
Abstract: Site-specific estimation of lime requirement requires high-resolution maps of soil organic carbon (SOC), clay and pH. These maps can be generated with digital soil mapping models fitted on covariates observed by proximal soil sensors. However, the quality of the derived maps depends on the applied methodology. We assessed the effects of (i) training sample size (5–100); (ii) sampling design (simple random sampling (SRS), conditioned Latin hypercube sampling (cLHS) and k-means sampling (KM)); and (iii) prediction model (multiple linear regression (MLR) and random forest (RF)) on the prediction performance for the above mentioned three soil properties. The case study is based on conditional geostatistical simulations using 250 soil samples from a 51 ha field in Eastern Germany. Lin’s concordance correlation coefficient (CCC) and root-mean-square error (RMSE) were used to evaluate model performances. Results show that with increasing training sample sizes, relative improvements of RMSE and CCC decreased exponentially. We found the lowest median RMSE values with 100 training observations i.e., 1.73%, 0.21% and 0.3 for clay, SOC and pH, respectively. However, already with a sample size of 10, models of moderate quality (CCC > 0.65) were obtained for all three soil properties. cLHS and KM performed significantly better than SRS. MLR showed lower median RMSE values than RF for SOC and pH for smaller sample sizes, but RF outperformed MLR if at least 25–30 or 75–100 soil samples were used for SOC or pH, respectively. For clay, the median RMSE was lower with RF, regardless of sample size.
URI: https://opendata.uni-halle.de//handle/1981185920/119998
http://dx.doi.org/10.25673/118039
Open Access: Open access publication
License: (CC BY 4.0) Creative Commons Attribution 4.0(CC BY 4.0) Creative Commons Attribution 4.0
Journal Title: Precision agriculture
Publisher: Springer Science + Business Media B.V
Publisher Place: Dordrecht [u.a.]
Volume: 25
Original Publication: 10.1007/s11119-024-10122-3
Page Start: 1529
Page End: 1555
Appears in Collections:Open Access Publikationen der MLU

Files in This Item:
File Description SizeFormat 
s11119-024-10122-3.pdf2.72 MBAdobe PDFThumbnail
View/Open