Abstract:General aviation flight training exceedance event records are characterized by instantaneous discreteness, data scarcity, and missing features. Associated risk patterns are difficult to be captured by traditional single-factor or continuous-data-based analysis methods. A risk analysis method integrating data augmentation, deep clustering, and association mining is proposed in this paper. Flight feature snapshots were augmented with high fidelity by utilizing the WGAN-GP model. Flight parameter intervals were delineated by introducing the VBGMM. Latent associations among combinations of parameter intervals were mined by applying the FP-Growth algorithm. A total of 259 strong association rules are mined in this study. Four types of core risk patterns, including the speed-attitude bimodal risk of hard landing events and the rate-of-change mismatch of bank angle warning events, are identified with the assistance of knowledge graphs. Specific indicated airspeed intervals are revealed by the cross-event risk network as common risk factors connecting multiple types of exceedance events. Ideas for solving risk analysis problems of similar numerical discrete data are provided by this study. Parameter interval association rules are transformed into topological networks. Theoretical basis and decision support are provided for optimizing the early warning logic of flight quality monitoring, implementing targeted flight training, and improving the efficiency of the risk control system.