基于横向联邦学习的风险用户识别研究

樊春美; 杨立鹏; 李雯; 张智

doi:10.3969/j.issn.1005-8451.2024.08.12

基于横向联邦学习的风险用户识别研究

Risk user identification based on horizontal federated learning

摘要

摘要: 第三方平台推出的各种铁路旅客抢票服务，给中国铁路12306互联网售票系统（简称：12306）带来了较大压力，为保障12306的稳定性和旅客购票的公平性，亟需对风险用户进行识别。为应对因12306部署在不同的物理位置、不同中心的数据聚合存在一定风险的情况，研究在用户数据分散的条件下，基于横向联邦学习的风险用户识别方法。文章基于用户的访问行为，构建和提取用户特征，构建基于XGboost、逻辑回归和神经网络等算法的横向联邦学习模型，并进行模型验证。实验结果表明，基于XGboost算法的横向联邦学习模型具有较好的风险用户识别效果，为铁路数据的安全使用提供了技术支撑。

Abstract: Various railway passenger ticket grabbing services launched by the third-party platform have brought great pressure to the China railway 12306 Internet ticketing and reservation system (12306 for short). In order to ensure the stability of 12306 and the fairness of passenger ticket purchase, it is urgent to identify risk users. This paper aimed to address the risk of data aggregation caused by the deployment of 12306 in different physical locations and centers, studied a risk user identification method based on horizontal federated learning under the condition of dispersed user data. Based on user access behavior, the paper constructed and extracted user features, constructed a horizontal federated learning model using algorithms such as XGboost, logistic regression, and neural networks, and validated the model. The experimental results show that the horizontal federated learning model based on XGboost algorithm has good risk user recognition performance, provides technical support for the safe use of railway data.

HTML全文

参考文献(19)

施引文献

资源附件(0)