土壤氧化铁的特征波长选择和高光谱反演

Characteristic wavelength selection and hyperspectral inversion of soil iron oxide

  • 摘要: 传统的氧化铁特征波长选取方法单一, 多仅利用相关系数法(CC), 建模输入变量较多导致预测精度不高。为提高模型预测精度, 在云南禄丰恐龙谷南缘山地采集135个5~20 cm表层土壤样品, 室内测定样品的光谱反射率和氧化铁含量。对土壤光谱曲线进行Savizky-Golay平滑后作为原始光谱曲线(OR), 对原始光谱曲线进行一阶微分(FD)和倒数的对数(RL)变换; 通过与氧化铁含量的相关性分析, 利用迭代保留信息变量(IRIV)、竞争性自适应重加权算法(CARS)和连续投影法(SPA)算法提取特征波长, 把提取的特征波长作为自变量, 氧化铁含量作为因变量, 分别用随机森林回归(RF)和偏最小二乘回归(PLSR)进行反演模型的构建。结果表明, 相关性分析结合IRIV、CARS和SPA算法提取特征波长的方法可以有效降低建模波长数; 通过结合不同的光谱变换方法和建模方法, 发现RL-CC-CARS-PLSR模型的效果最好, 其建模集R2为0.833, RMSE为4.361 g·kg−1, 验证集R2为0.826, RMSE为5.600 g·kg−1, RPIQ达3.618, 模型具有很好的稳定性和预测能力。该研究为利用高光谱反演土壤氧化铁含量提供了参考。

     

    Abstract: Understanding the soil content of iron oxide is of great significance in revealing the soil environment and guiding crop production. Hyperspectral is widely used for inversion of soil physical and chemical properties owing to its high spectral resolution and strong wavelength continuity. However, hyperspectral data have the disadvantage of wavelength redundancy; therefore, the selection of characteristic wavelengths is a necessary step for soil hyperspectral modeling and analysis. The traditional method for selecting the characteristic wavelength of soil iron oxide is single, and only correlation coefficient (CC) method is used to select characteristic wavelengths. Hence, there are too many input variables in the model, which reduces its prediction accuracy. The purpose of this study was to explore the predictive ability of different models constructed by using correlation analysis combined with various characteristic wavelength selection algorithms for soil iron oxide content and compare the accuracy of the models constructed by using the correlation coefficient method. A total of 135 soil samples were collected from the surface of the study area in the southern margin of Lufeng Dinosaur Valley in Yunnan Province. The spectral reflectance and iron oxide content of the samples were measured in the laboratory. The Kennard-Stone algorithm was used to divide the soil samples into a calibration set of 95 samples and validation set of 45 samples. The soil spectral curve was smoothed by Savizky-Golay as the original spectral reflectance (OR), and the original spectrum was transformed using first-order differential reflectance (FD) and reciprocal logarithm reflectance (RL). Based on the correlation analysis of iron oxide content with original spectrum and its transformed spectrum, the iteratively retaining informative variables (IRIV), competitive adaptive reweighted sampling (CARS), and successive projection algorithm (SPA) were used to extract the characteristic wavelength. With the extracted characteristic wavelength as the independent variable and the iron oxide content as the dependent variable, inversion models were constructed by using random forest regression (RF) and partial least squares regression (PLSR). The prediction accuracy of these models was evaluated by comparing the determination coefficient (R2), root mean square error (RMSE), and the ratio of performance to interquartile distance (RPIQ). The results showed that the further use of IRIV, CARS, and SPA algorithms to extract the characteristic wavelength after correlation analysis could effectively reduce the number of modeling variables. For the four characteristic wavelength selection methods, it is necessary to fit with different spectral transformation and modeling methods for improving the prediction accuracy. Comprehensive comparison of all the characteristic wavelength selection methods showed that the PLSR model built using the RL-CC-CARS method achieved the best performance. The R2, RMSE, and RPIQ the validation set were 0.826, 5.600 g∙kg−1, and 3.618, respectively. Based on the 1∶1 scatter plot, the measured soil iron oxide contents and the predicted ones of RL-CC-CARS-PLSR were close to the 1∶1 line and had good prediction results. Appropriate variable selection can improve the performance of the model, simplify the regression model, and improve the accuracy of iron oxide estimation. This study provides a reference for the inversion of soil iron oxide content using hyperspectral data.

     

/

返回文章
返回