高级检索

    基于融合多模态目标识别算法的运煤火车外观质量状态多任务检测方法

    Multi-Task Detection Method for Coal-Carrying Train Appearance Quality Status Based on Integrated Multi-Modal Object Recognition Algorithms

    • 摘要: 传统目标检测方法在应对复杂运煤火车裂纹、缺陷及部件缺失等多类质量检测场景时,面临多缺陷目标混杂、样本分布不均衡、微小目标检测困难等挑战,导致现有铁路货车装载监测系统难以实现精准高效的外观质量评估。为此,本研究提出了一种融合多模态视觉目标识别算法的新型检测方法。首先,以YOLOX-s为基本框架,提出了轻量化目标检测网络,提升目标识别效率;其次,建立了多特征融合的路径聚合特征金字塔(PAFPN)网路模块,实现对多特征任务的同步识别;最后,优化设计了带平衡参数的改进交并比损失函数,提升了目标识别算法精度。实验表明,改进后的目标检测算法推理效率较基线模型提升91%,平均精度均值(mAP)提高5%。进一步的,通过融合目标检测、图像分类与文本识别算法,构建多模态协同检测网络框架,使系统整体识别精度提升13%,处理耗时仅增加7 ms。本研究提出的方法有效解决了运煤车厢多特征状态同步识别难题,其检测效率与精度的协同优化机制,为大型运动设备外观质量监测提供了创新性解决方案,具有显著的应用价值。

       

      Abstract: Traditional object detection methods face challenges such as multi-defect target interference, imbalanced sample distribution, and difficulties in detecting small targets when applied to complex quality inspection scenarios of coal-carrying trains, including crack detection, defect identification, and component absence verification. These limitations result in insufficient precision and efficiency for current railway freight loading monitoring systems in conducting comprehensive appearance quality assessments. To address these issues, this study proposes a novel detection method integrating multi-modal visual algorithms. First, a lightweight object detection network based on the YOLOX-s framework is developed to enhance recognition efficiency. Second, a path-aggregated feature pyramid network module with multi-feature fusion is established to enable synchronous identification of multi-feature tasks. Finally, an improved Intersection over Union (IoU) loss function with balanced parameters is designed to optimize algorithm accuracy. Experimental results demonstrate that the enhanced detection algorithm achieves 91% higher inference efficiency than the baseline model while improving mean Average Precision (mAP) by 5%. Furthermore, by integrating object detection, image classification, and text recognition algorithms into a multi-modal collaborative detection framework, the system's overall recognition accuracy improves by 13% with only a 7 ms increase in processing time. The proposed methodology effectively resolves the challenges of synchronous multi-feature state recognition for coal wagons, and its coordinated optimization mechanism between detection efficiency and accuracy provides an innovative solution for appearance quality monitoring of large moving equipment, demonstrating significant application value.

       

    /

    返回文章
    返回