Abstract:In the monitoring process of gravel soil slope,there is a serious mismatch between the monitoring data of the landslide stable deformation stage and the pre-sliding stage,which leads to the small amount of data in the pre-sliding stage,and the unbalanced data set resulting in the inaccurate prediction.In this paper,a prediction algorithm of surface displacement in the pre-slip stage of gravel soil slope based on DST random forest is proposed.Firstly,the threshold comparison method of surface displacement pre-slip is used to determine whether there is an unbalanced data set.If there is an unbalanced data set,the double sampling technology (DST) is used to collect the unbalanced data set,and the negative samples are oversampled to improve the proportion of negative sample data set.Undersampling of positive sample data is carried out to reduce the proportion of positive sample data set.Equal amount of random positive and negative samples are selected as the training set to be processed,and the random forest algorithm is built to test the processed training set,and the test set after training is compared with the predicted results before the collection of non-equilibrium data set.Finally,by comparing the error values and error rate before and after sampling,it is verified that the prediction error rate of DST random forest prediction algorithm is reduced to 3.39% compared with the ordinary random forest prediction algorithm (the prediction error rate is 4.66%),which proves the necessity of double sampling technology (DST) to collect non-equilibrium data set in the pre-slip stage. Finally, it is concluded that DST random forest algorithm can obviously improve the pre-warning effect of the pre-slip stage, and overcome the problems of voting average, stagnation and increasing error rate.