Abstract:
Timely and accurate prediction of water eutrophication is of great significance to lake management, but the traditional early warning model of water eutrophication is difficult to meet the demand for accurate monitoring of the eutrophication status of water bodies, and the prediction accuracy still needs to be improved. We collected the water quality data of Ulansuhai Lake from 2011 to 2020, including water temperature, chemical oxygen demand, total nitrogen, total phosphorus, chlorophyll-a concentration, and other key indicators. Chlorophyll a was used to characterize the eutrophication of the water body, and the temporal and spatial patterns of chlorophyll a concentration were explored. A model integrating Spearman's correlation analysis and random forest was constructed to predict the eutrophication status of water bodies in Ulansuhai Lake. The accuracy and stability of the model were verified by using various assessment indexes such as accuracy, recall rate, F1 value, ROC curve, and confusion matrix. The results showed that chlorophyll a concentration in Ulansuhai Lake fluctuated significantly from 2011 to 2020, and the annual average value showed a trend of "decreasing-rising-decreasing". The accuracy of the model in the training set was 95.89%, and it was able to accurately distinguish between eutrophication and non-eutrophication states (category 1 and category 0). A total of 80 category 0 judgments and 16 category 1 warnings occurred in the test set, of which 90 matched the actual water body status, with a test accuracy of 93.75%, a recall rate of 85.71%, and an F1 value of 0.8, which proved the model's ability to provide early warnings in real-world scenarios, and the confusion matrix and ROC curve analyses further verified the model's efficiency and reliability. The model was also tuned with hyperparameters during the training of the training set, resulting in an average cross-validation accuracy of 91.32%. The feature importance analysis showed that the water quality indicators, such as ammonia nitrogen and total phosphorus, played a significant role in the prediction of the eutrophication status of water bodies. This study provides an effective early warning tool for accurate monitoring of water body eutrophication and lake management, and has practical application value.