基于双采样随机森林的临滑阶段的预测算法:以湖北黄石某治理地块为例
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

P642.22;P694

基金项目:

(JELRGBDT202206);江西省自然科学(20212BAB203004) 江西省防震减灾与工程地质灾害探测工程研究中心开放(SDGD202005)


Prediction algorithm of pre-slip stage based on double sampling random forest
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对碎石土边坡监测过程中滑坡稳定变形期与临滑阶段监测数据量严重不匹配,导致临滑阶段数据量偏小,从而产生的非平衡数据集造成预判不准确的问题,提出一种基于DST随机森林的碎石土边坡临滑阶段地表位移的预测算法。首先,采用过采样和欠采样相结合的双采样技术(DST,DoubleSamplingTechnique)对地表位移中的非平衡数据集进行采集,然后,通过随机森林预测算法有放回的随机抽样进行预测,最后,通过实验得出预测结果。结果表明:DST随机森林预测算法相比于普通随机森林预测算法预测误差率降低到3.39%,证明双采样技术(DST)采集临滑阶段非平衡数据集的必要性。

    Abstract:

    In the monitoring process of gravel soil slope,there is a serious mismatch between the monitoring data of the landslide stable deformation stage and the pre-sliding stage,which leads to the small amount of data in the pre-sliding stage,and the unbalanced data set resulting in the inaccurate prediction.In this paper,a prediction algorithm of surface displacement in the pre-slip stage of gravel soil slope based on DST random forest is proposed.Firstly,the threshold comparison method of surface displacement pre-slip is used to determine whether there is an unbalanced data set.If there is an unbalanced data set,the double sampling technology (DST) is used to collect the unbalanced data set,and the negative samples are oversampled to improve the proportion of negative sample data set.Undersampling of positive sample data is carried out to reduce the proportion of positive sample data set.Equal amount of random positive and negative samples are selected as the training set to be processed,and the random forest algorithm is built to test the processed training set,and the test set after training is compared with the predicted results before the collection of non-equilibrium data set.Finally,by comparing the error values and error rate before and after sampling,it is verified that the prediction error rate of DST random forest prediction algorithm is reduced to 3.39% compared with the ordinary random forest prediction algorithm (the prediction error rate is 4.66%),which proves the necessity of double sampling technology (DST) to collect non-equilibrium data set in the pre-slip stage. Finally, it is concluded that DST random forest algorithm can obviously improve the pre-warning effect of the pre-slip stage, and overcome the problems of voting average, stagnation and increasing error rate.

    参考文献
    相似文献
    引证文献
引用本文

郭明娟,徐哈宁,肖慧,等. 基于双采样随机森林的临滑阶段的预测算法:以湖北黄石某治理地块为例[J]. 科学技术与工程, 2024, 24(14): 5733-5741.
Guo Mingjuan, Xu Haning, Xiao Hui, et al. Prediction algorithm of pre-slip stage based on double sampling random forest[J]. Science Technology and Engineering,2024,24(14):5733-5741.

复制
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2023-07-12
  • 最后修改日期:2024-05-08
  • 录用日期:2023-10-06
  • 在线发布日期: 2024-05-30
  • 出版日期:
×
《科学技术与工程》诚挚邀请审稿专家