• 查询稿件
  • 获取最新论文
  • 知晓行业信息

基于GMM聚类的铁路网络数据风险等级分类方法

Risk level classification method for railway network data based on GMM clustering

  • 摘要: 铁路行业信息基础设施及重要信息系统产生的数据种类繁多、数量庞大且价值密度高,而不同类型或等级的铁路网络数据存在不同级别的安全风险。为了完善铁路网络数据风险评估机制,设计一种基于高斯混合模型(GMM,Gaussian Mixture Model)聚类的铁路网络数据风险等级分类方法。从数据和风险角度提取关键信息,构建风险信息数据集;通过K-means聚类获得初始聚类中心;基于混合距离计算进行GMM聚类,实现数据风险等级划分。经实验验证,与传统K-means聚类、谱聚类算法相比,GMM聚类算法对铁路网络数据的聚类效果更优,能够更加准确地对铁路网络数据进行风险等级分类,从而为进一步落实铁路网络数据安全管理要求提供重要的技术支撑。

     

    Abstract: The information infrastructure and important information systems in the railway industry generate a wide variety of data types, large quantities, and high value density, and different types or levels of railway network data have different levels of security risks. In order to improve the risk assessment mechanism for railway network data, this paper designed a risk level classification method for railway network data based on GMM clustering. The paper extracted key information from the perspectives of data and risk, and constructs a risk information dataset, obtained initial cluster centers through K-means clustering, performed GMM clustering based on mixed distance calculation, and implemented data risk level classification. Through experimental verification, compared with traditional K-means clustering and spectral clustering algorithms, the GMM clustering algorithm has a better clustering effect on railway network data and can more accurately classify the risk level of railway network data, which provide important technical support for further implementing the requirements of railway network data security management.

     

/

返回文章
返回