基于机器学习的江汉平原土壤多功能评估与空间分异研究

A study on the multifunctional assessment and spatial differentiation of soil in the Jianghan Plain based on machine learning

  • 摘要: 土壤多功能性评估对区域生态安全和可持续发展具有重要意义,为构建江汉平原土壤多功能评估体系并揭示其空间分异规律,采用主成分分析从27个土壤参数中筛选出14个核心指标构建最小数据集,基于初级生产力、水质净化与调节、气候调节与碳封存、生物多样性维持及养分供应与循环5项土壤功能,运用随机森林、支持向量机、XGBoost、神经网络和梯度提升机5种机器学习算法构建预测模型,采用Getis-Ord Gi*方法分析土壤多功能性空间分布特征。结果表明:最小数据集实现48.15%降维率,与全量数据集相关性达0.816(R2=0.667),在保持91.72%原始信息的前提下显著提升评估效率。不同机器学习算法在各土壤功能预测中表现出差异化优势,XGBoost在初级生产力(R2=0.979 7)和生物多样性维持(R2=0.872 2)预测中表现最优,SVM在水质净化调节(R2=0.939 2)和养分供应循环(R2=0.802 5)预测中效果最佳,神经网络在气候调节与碳封存(R2=0.959 9)预测中性能突出。土壤功能空间分布呈“北低南高、中部适中”格局,土壤有机碳密度和黏土含量为主要驱动因子,多功能热点区域仅占1.03%,边缘显著区域达25.71%,土壤功能间存在显著权衡关系。研究建立了基于机器学习的土壤多功能评估技术体系,可为江汉平原土壤功能精准管理和生态保护策略制定提供科学依据。

     

    Abstract: The assessment of soil multifunctionality is crucial for regional ecological security and sustainable development. This study aimed to construct a comprehensive soil multifunctionality assessment framework for the Jianghan Plain and elucidate its spatial distribution patterns. Principal component analysis was employed to screen 14 core indicators from 27 soil parameters, establishing a Minimum Data Set (MDS). Based on five key soil functions (primary productivity, water purification and regulation, climate regulation and carbon sequestration, biodiversity maintenance, and nutrient supply and cycling), predictive models were developed using five machine learning algorithms: Random Forest, Support Vector Machine (SVM), XGBoost, Neural Network, and Gradient Boosting Machine. Spatial distribution characteristics of soil multifunctionality were analyzed using the Getis-Ord Gi* method. The results revealed that: (1) The MDS achieved a 48.15% dimensionality reduction rate with a correlation coefficient of 0.816 (R2= 0.667) compared to the full dataset, retaining 91.72% of the original information while significantly improving assessment efficiency. (2) Different machine learning algorithms demonstrated varying advantages in predicting specific soil functions. XGBoost performed optimally for primary productivity (R2= 0.9797) and biodiversity maintenance (R2= 0.8722) predictions, while SVM excelled in water purification and regulation (R2= 0.9392) and nutrient supply and cycling (R2= 0.8025) predictions. Neural Network showed superior performance in climate regulation and carbon sequestration (R2 = 0.9599) prediction. (3) Soil function spatial distribution exhibited a distinctive pattern characterized as "low in the north, high in the south, and moderate in the central region." Soil organic carbon density and clay content emerged as the primary driving factors. Multifunctional hotspots accounted for only 1.03% of the study area, while edge-significant regions comprised 25.71%, indicating substantial trade-offs among soil functions. This research establishes a machine learning-based technical framework for soil multifunctionality assessment, thereby providing scientific support for precision soil function management and ecological conservation strategy development in the Jianghan Plain.

     

/

返回文章
返回