• 查询稿件
  • 获取最新论文
  • 知晓行业信息

基于层次化主题分析的铁路敏感数据智能识别与分类分级方法

Intelligent identification and classification and grading method of railway sensitive data based on hierarchical topic analysis

  • 摘要: 为了对铁路不同敏感等级数据实施差异化保护,文章提出了一种基于层次化主题分析的铁路敏感数据智能识别与分类分级方法,旨在为铁路网络数据分级保护提供依据。该方法利用数据语义和分类分级规则建立主题词库,通过主题分析初步判断数据敏感级别。考虑到铁路网络数据的敏感级别分布不平衡,设计分级概率向量加权聚合机制,利用凝聚层次聚类算法实现准确定级。经实验验证,与基于语义和K-means聚类的传统主题分析方法相比,该方法可有效缓解分布不平衡问题,实现细粒度、动态可调整的铁路敏感数据智能识别与准确定级,从而为落实铁路网络数据分级管理要求、确保铁路网络数据安全可控提供技术支撑。

     

    Abstract: In order to implement differentiated protection for railway data with different sensitivity levels, this paper proposed an intelligent recognition and classification and grading method for railway sensitive data based on hierarchical topic analysis, aimed to provide a basis for the grading protection of railway network data. This method utilized data semantics and classification and grading rules to establish a topic lexicon, and preliminarily determined the sensitivity level of data through topic analysis. Considering the uneven distribution of sensitivity levels in railway network data, the paper designed a graded probability vector weighted aggregation mechanism and used the cohesive hierarchical clustering algorithm to implement accurate grading. Through experimental verification, compared with traditional topic analysis methods based on semantics and K-means clustering, this method can effectively alleviate the problem of imbalanced distribution, implement fine-grained, dynamically adjustable intelligent recognition and accurate grading of railway sensitive data, and provide technical support for implementing the requirements of railway network data grading management and ensuring the security and controllability of railway network data.

     

/

返回文章
返回