Spectral variable selection for estimation of soil organic carbon content using mid-infrared spectroscopy

A - Papers appearing in refereed journals

Wang, J., Liu, T., Zhang, J., Yuan, H. and Acquah, G. 2022. Spectral variable selection for estimation of soil organic carbon content using mid-infrared spectroscopy. European Journal of Soil Science. 73 (4), p. e13267. https://doi.org/10.1111/ejss.13267

AuthorsWang, J., Liu, T., Zhang, J., Yuan, H. and Acquah, G.
Abstract

The non-destructive and rapid estimation of soil total carbon (SOC) content with mid-infrared spectroscopy (MIR, 4000–400 cm−1) will play a vital role in precision agriculture. However, the benefit derived from the full MIR range is compromised by multicollinearity and noise. Hence, variable selection methods have been developed to reduce the full spectrum to a few variables that contribute the most information to a property of interest. However, only a few studies have applied variable selection methods in the MIR region to estimate organic carbon content in the soil. Therefore, four variable selection methods, namely stability competitive adaptive reweighted sampling (sCARS), bootstrapping soft shrinkage (BOSS), interval combination optimization (ICO) and the interval combination optimization-successive projections algorithm (ICO-SPA) method were investigated to ascertain the method that identified the wavebands most sensitive to SOC in order to improve prediction accuracy. The selected variables (i.e., reduced spectrum), as well as the full MIR spectrum, were coupled with partial least squares regression (PLSR) for the model calibration of SOC. The results showed that the models based on variable selection achieved higher prediction accuracy than the full spectrum model. sCARS selected 19 variables, BOSS selected 21 variables, ICO selected 311 variables, whereas ICO-SPA selected only 9 variables (accounting for 0.38% of all variables), while the prediction accuracy of the ICO-SPA-PLSR model was similar to those of the other three variable selection methods. The ICO-SPA-PLSR model had an Rp2 value of 0.93, RPD of 3.90 and RMSEP of 0.13%. This method identified 3450, 2920, 2767, 2000, 1800, 1765, 1600, 1560 and 927 cm−1 as the feature wavenumbers with the most useful information in accurately estimating SOC content. Therefore, the combination strategy of interval (ICO) and individual (SPA) variable selection may be a good alternative variables selection method for MIR spectroscopic data.

Year of Publication2022
JournalEuropean Journal of Soil Science
Journal citation73 (4), p. e13267
Digital Object Identifier (DOI)https://doi.org/10.1111/ejss.13267
Open accessPublished as green open access
FunderBiotechnology and Biological Sciences Research Council
National Key Research and Development Program of China
Output statusPublished
Publication dates
Online16 Jun 2022
PublisherWiley
ISSN1351-0754

Permalink - https://repository.rothamsted.ac.uk/item/98987/spectral-variable-selection-for-estimation-of-soil-organic-carbon-content-using-mid-infrared-spectroscopy

Restricted files

Publisher's version

Under embargo indefinitely

Accepted author manuscript

Under embargo indefinitely

103 total views
17 total downloads
0 views this month
0 downloads this month