Spectral variable selection for estimation of soil organic carbon content using mid-infrared spectroscopy

Wang, J., Liu, T., Zhang, J., Yuan, H. and Acquah, GiftyORCID logo (2022) Spectral variable selection for estimation of soil organic carbon content using mid-infrared spectroscopy. European Journal of Soil Science, 73 (4). e13267. 10.1111/ejss.13267
Copy

The non-destructive and rapid estimation of soil total carbon (SOC) content with mid-infrared spectroscopy (MIR, 4000–400 cm−1) will play a vital role in precision agriculture. However, the benefit derived from the full MIR range is compromised by multicollinearity and noise. Hence, variable selection methods have been developed to reduce the full spectrum to a few variables that contribute the most information to a property of interest. However, only a few studies have applied variable selection methods in the MIR region to estimate organic carbon content in the soil. Therefore, four variable selection methods, namely stability competitive adaptive reweighted sampling (sCARS), bootstrapping soft shrinkage (BOSS), interval combination optimization (ICO) and the interval combination optimization-successive projections algorithm (ICO-SPA) method were investigated to ascertain the method that identified the wavebands most sensitive to SOC in order to improve prediction accuracy. The selected variables (i.e., reduced spectrum), as well as the full MIR spectrum, were coupled with partial least squares regression (PLSR) for the model calibration of SOC. The results showed that the models based on variable selection achieved higher prediction accuracy than the full spectrum model. sCARS selected 19 variables, BOSS selected 21 variables, ICO selected 311 variables, whereas ICO-SPA selected only 9 variables (accounting for 0.38% of all variables), while the prediction accuracy of the ICO-SPA-PLSR model was similar to those of the other three variable selection methods. The ICO-SPA-PLSR model had an Rp2 value of 0.93, RPD of 3.90 and RMSEP of 0.13%. This method identified 3450, 2920, 2767, 2000, 1800, 1765, 1600, 1560 and 927 cm−1 as the feature wavenumbers with the most useful information in accurately estimating SOC content. Therefore, the combination strategy of interval (ICO) and individual (SPA) variable selection may be a good alternative variables selection method for MIR spectroscopic data.

visibility_off picture_as_pdf

picture_as_pdf
European J Soil Science - 2022 - Wang - Spectral variable selection for estimation of soil organic carbon content using.pdf
subject
Published Version
lock
Restricted to Repository staff only
Available under Creative Commons: Attribution 4.0

visibility_off picture_as_pdf

Accepted Version
lock

Atom BibTeX OpenURL ContextObject in Span OpenURL ContextObject Dublin Core MPEG-21 DIDL Data Cite XML EndNote HTML Citation METS MODS RIOXX2 XML Reference Manager Refer ASCII Citation
Export

Downloads