面向建设期铁路大数据的分级存储方法研究

廉小亲; 杨凯; 程智博; 王万齐; 吴艳华

doi:10.3969/j.issn.1005-8451.2022.02.04

面向建设期铁路大数据的分级存储方法研究

Railway big data hierarchical storage method oriented to construction period

摘要

摘要: 我国铁路网包含众多建设期和运营期路段，均会产生大量业务数据，然而传统的单节点大数据存储方式存在访问速度慢和时效性低等局限性，无法有效缓解数据存储压力。文章基于数据分级存储的思想，设计一种分布式大数据分级存储架构；综合考虑建设期铁路大数据的业务属性和存储数据库的固有属性，建立一套数据价值评价体系；基于专家评价法计算各数据表在不同评价维度下的价值，并通过K-means聚类算法判定各数据表相应的存储级别；以某建设期铁路大数据为实验样本进行验证，实验结果表明，文章提出的价值评价体系能够有效地对铁路建设期大数据进行存储级别判定，实现了面向建设期铁路大数据的分级存储。

Abstract: China's railway network contains many railway sections of construction periods and operation periods, which produce a large number of business data. However, the traditional single-node big data storage method has limitations such as slow access speed and low timeliness, which cannot effectively alleviate the pressure of data storage. Based on the idea of data hierarchical storage, this paper designed a distributed hierarchical storage architecture of big data, comprehensively considered the business attributes of railway big data in the construction period and the inherent attributes of storage database, and established a set of data value evaluation system, calculated the value of each data table under different evaluation dimensions based on expert evaluation method, and determined the corresponding storage level of each data table through K-means clustering algorithm. The paper took the railway big data in a construction period as the experimental sample for verification. The experimental results show that the value evaluation system proposed in this paper can effectively judge the storage level of railway big data in the construction period, and implement the hierarchical storage of railway big data oriented to the construction period.

HTML全文

参考文献(18)

施引文献

资源附件(0)