基于人类活动因子和随机森林模型改进土壤有机碳密度预测及制图

Improvement of soil organic carbon density prediction and mapping based on human activity factors and random forest model

  • 摘要: 土壤有机碳(SOC)密度是影响粮食安全和农业决策的重要土壤属性。以往SOC密度预测研究多基于自然环境变量开展。然而, 在农业生产频繁地区, 人类活动也会在一定程度上影响土壤性质。本研究以黄淮海平原为研究区域, 选择24个环境变量及人口密度、建筑物体积、道路网密度和人类热排放4个人类活动变量, 探讨人类活动对耕地SOC密度预测的重要性。结果表明, 环境协变量仅能解释耕地SOC密度变化的35%。添加人类活动变量后, 决定系数(R2)和林氏一致性相关系数(Lin’s concordance correlation coefficient, LCCC)分别提高37.14%和19.67%, 平均绝对误差(MAE)和均方根误差(RMSE)分别降低8.47%和9.88%, 显示出更好的模型性能和预测准确性。这说明在黄淮海平原地区, 人类活动对区域SOC密度的空间分异具有重要影响。变量重要性分析发现, 白天最高地表温度是最重要的预测因子, 其次是白天地表温度的标准差。人类活动变量中, 人类热排放是最重要的预测变量, 重要性占比为8.25%, 其次是人口密度、建筑物体积和道路网密度。

     

    Abstract: Soil organic carbon (SOC) density is a critical soil attribute that not only sustains soil fertility and regulates terrestrial carbon cycles but also influences regional food security and agricultural management decisions. While previous research on SOC density prediction have primarily relied on natural environmental variables, such as climate, topography, and vegetation, they have often overlooked the role of anthropogenic disturbances, especially in heavily human-impacted areas where human activities significantly alter soil properties. This study focuses on the Huang-Huai-Hai Plain (HHH Plain), a major grain-producing region in China characterized by intensive agriculture and urbanization. To improve SOC density prediction accuracy for cultivated land, a random forest (RF) algorithm, integrating 24 environmental covariates (climate, soil parent materials, topography, vegetation, land surface thermal conditions, and soil properties) were integrated with four human activity variables (population density, built-up volume, road network density, and hourly anthropogenic heat flux). Model parameters were optimized using five-fold cross-validation (n_estimators=100 and max_depth=4) and performance was assessed via mean absolute error (MAE), root mean square error (RMSE), coefficient of determination (R²), and Lin’s Concordance Correlation Coefficient (LCCC). The baseline model using only environmental covariates (Model 1) explained 35% of SOC density variation. In contrast, the integrated model (Model 6) incorporating both environmental and all four human activity variables improved prediction performance: R² and LCCC increased by 37.14% and 19.67%, respectively, while MAE and RMSE decreased by 8.47% and 9.88%, respectively. This model accounted for 48% SOC density variation, highlighting the indispensable role of anthropogenic factors. Among all predictors, daytime land surface temperature was the most influential environmental factor, while hourly anthropogenic heat flux emerged as the most critical human activity factor, contributing 8.25% to prediction importance, surpassing population density (2.90%), built-up volume (0.58%), and road network density (0.06%). These findings demonstrate that integrating human activity factors, particularly hourly anthropogenic heat flux, is essential for accurate SOC density predictions in the HHH Plain. This study provides a scientific basis for regional soil carbon management, sustainable agricultural development, and ecological protection, while offering a reference for similar studies in human-dominated agricultural regions globally.

     

/

返回文章
返回