现代制造工程 ›› 2024, Vol. 528 ›› Issue (9): 127-135.doi: 10.16731/j.cnki.1671-3133.2024.09.017

• 仪器仪表/检测/监控 • 上一篇    下一篇

基于机器视觉的高铁碳滑板图像分割算法研究*

刘伟民1, 张少宁1, 郑爱云1, 刘晋2, 郑直1   

  1. 1 华北理工大学机械工程学院,唐山 063210;
    2 中车唐山机车车辆有限公司,唐山 064000
  • 收稿日期:2024-01-03 出版日期:2024-09-18 发布日期:2024-09-27
  • 通讯作者: 郑爱云,硕士,副教授,主要研究方向为工业物联网。 E-mail:zay@ncst.edu.cn
  • 作者简介:刘伟民,博士,副教授,主要研究方向为工业物联网。E-mail:lzhjia@ncst.edu.cn; 张少宁,硕士研究生,主要研究方向为高铁智能运维。E-mail:13472050594@163.com; 刘晋,学士,正高级工程师,主要研究方向为轨道交通装备研发。 郑直,博士/博士后,副教授,主要研究方向为智能故障诊断。
  • 基金资助:
    *河北省科技重大专项项目(22282203Z);河北省自然科学基金资助项目(E2022209086)

Research on image segmentation algorithm of high-speed rail carbon skateboard based on machine vision

LIU Weimin1, ZHANG Shaoning1, ZHENG Aiyun1, LIU Jin2, ZHENG Zhi1   

  1. 1 College of Mechanical Engineering,North China University of Science and Technology, Tangshan 063210,China;
    2 CRRC Tangshan Co.,Ltd.,Tangshan 064000,China
  • Received:2024-01-03 Online:2024-09-18 Published:2024-09-27

摘要: 针对语义分割模型识别碳滑板边缘困难、复杂背景干扰性较大以及特征信息丢失严重等问题,提出一种新型编解码结构的Swin Transformer语义分割优化算法。首先,主干网络采用U型的编解码结构,实现多尺度的信息融合;其次,添加注意力局部增强感知模块来扩大感受野并提高模型泛化能力;然后,采用具有数据相关性的上采样结构,以提高上采样质量,摆脱分辨率对预测结果的影响,加强图像重建能力;最后,将跳跃连接更换为残差路径,使编解码结构中的语义信息联系更加紧密,提升训练效率。实验结果表明,Swin Transformer语义分割优化算法相较基线算法测量预测精度提高了3.63 %,所有类别中的像素分类正确率的平均值提高了7.29 %。研究结果综合验证了新型编解码结构的Swin Transformer语义分割模型在识别和处理碳滑板任务中的优越性及鲁棒性。

关键词: 碳滑板, Swin Transformer, 局部增强感知, 图像重建, 语义分割

Abstract: A novel Swin Transformer semantic segmentation optimization algorithm with a codec structure is proposed to solve the problems such as difficulty in identifying the edge of carbon sliding plate by semantic segmentation model,large interference in complex background,and serious feature information loss. Firstly,the backbone network adopts U-shaped codec structure to realize multi-scale information fusion.Secondly,the attention local enhancement module is added to expand the sensing field and improve the model generalization ability. Then,the upsampling structure with data correlation is used to enhance the quality of upsampling,eliminate the impact of resolution on prediction results,and improve image reconstruction capability. Finally,the skip connection is replaced by the residual path to make the semantic information in the codec structure more closely connected and improve the training efficiency. The experimental results show that the Swin Transformer semantic segmentation algorithm improves the measurement prediction accuracy by 3.63 %,and the average accuracy of pixel classification in all categories is improved by 7.29 %. The research results confirm the superiority and robustness of the Swin Transformer semantic segmentation model in identifying and handling carbon slide tasks.

Key words: carbon sliding plate, Swin Transformer, local enhanced sensing, image reconstruction, semantic segmentation

中图分类号: 


版权所有 © 《现代制造工程》编辑部 
地址:北京市东城区东四块玉南街28号 邮编:100061 电话:010-67126028 电子信箱:2645173083@qq.com
本系统由北京玛格泰克科技发展有限公司设计开发 技术支持:support@magtech.com.cn
访问总数:,当日访问:,当前在线: