• 查询稿件
  • 获取最新论文
  • 知晓行业信息

基于多模态图文检索的铁路遗失物品查找方法研究

Railway lost items retrieval method based on multimodal image-text retrieval

  • 摘要: 为实现铁路遗失物品文字信息与图像信息的精确匹配,提高铁路遗失物品的找回率,实现铁路客运领域降本增效,提升铁路客运服务质量与管理水平,提出一种铁路遗失物品查找方法。该方法先结合遗失物品的图像和文本描述,通过信息抽取技术提取物品的关键特征;再利用Chinese-CLIP(Contrastive Language-Image Pretraining)多模态图文检索技术,将图像信息和文本信息映射到统一的语义空间,并使用真实数据进行训练,实现图像与文本间的精准匹配,从海量的遗失物品登记信息中高效地筛选出与遗失物品最相关的结果。实验结果表明,在遗失物品查找任务中,该检索方法相较于传统方法优势显著,能够有效提升检索准确率与响应速度,从而更好地满足旅客和管理人员对高效、便捷查找物品的需求。

     

    Abstract: To implement precise matching between textual and image information of lost railway items, improve their recovery rate, realize cost reduction and efficiency improvement in railway passenger transportation, and enhance the quality and management level of its services, this paper proposed a retrieval method for lost railway items. The method first combined the images and textual descriptions of lost items, extracted their key features via information extraction technology, and subsequently employed Chinese-CLIP (Contrastive Language-Image Pretraining) multimodal image-text retrieval technology to map both visual and textual information into a unified semantic space. Trained on real-world data, the method efficiently filtered out the most relevant results from a large volume of lost item registration records, achieving accurate cross-modal matching. Experimental results show that in the lost item retrieval task, this method outperforms traditional approaches significantly, effectively improving retrieval accuracy and response speed, and thus better meeting the needs of passengers and management personnel for efficient and convenient retrieval of lost items.

     

/

返回文章
返回