多维度注意力与距离比例约束的行人重识别鲁棒方法
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP391.41;TP18

基金项目:

国家自然科学(62362063)、新疆维吾尔自治区自然科学基金资助项目(2023D01C21)


Robust Pedestrian Re-identification with Multi-dimensional Attention and Distance Ratio Constraint
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对行人重识别(Re-ID)任务中视角变化、遮挡及背景干扰导致特征判别性不足、训练稳定性差及复杂场景鲁棒性弱的问题,本文提出一种增强鲁棒性的多策略行人重识别网络。首先,针对特征聚焦能力不足与背景噪声干扰问题,在ResNet50主干网络的四个阶段嵌入Triplet Attention模块,通过通道、水平和垂直三向注意力机制动态调整特征权重,增强对行人关键区域的聚焦能力并抑制背景噪声。其次,针对传统Triplet Loss因忽略正负样本距离绝对比例而导致的特征空间分布冗余问题,在传统Triplet Loss基础上引入正负样本距离比例约束项,通过精细化调控特征空间分布,提升类内紧凑性与类间区分性。此外,针对网络训练初期的梯度不稳定问题,采用Kaiming初始化与归一化层优化策略,加速收敛并增强特征表征能力。实验表明,该方法在Market1501、DukeMTMC-ReID和MSMT17_V2数据集上的mAP分别达到88.4%、78.9%和54.6%,Rank-1指标分别为95.3%、88.9%和78.1%,均优于现有主流方法。热力图与排序图可视化进一步验证了模型对行人细节的精准捕捉及复杂场景的强适应性。本研究为提升跨摄像头行人匹配的鲁棒性提供了有效解决方案。

    Abstract:

    In pedestrian re-identification (Re-ID) tasks, insufficient feature discriminability, poor training stability, and weak robustness are often caused by viewpoint changes, occlusion, and background interference. A multi-strategy pedestrian re-identification network with enhanced robustness is proposed.Triplet Attention modules were embedded in the four stages of the ResNet50 backbone network. Channel, horizontal, and vertical attention mechanisms were used. Feature weights were dynamically adjusted. The focusing ability on key pedestrian areas was enhanced. Background noise was suppressed.A constraint term on the ratio of positive and negative sample distances was introduced on the basis of traditional Triplet Loss. The feature space distribution was finely regulated. Intra-class compactness was improved. Inter-class discriminability was enhanced.Kaiming initialization and normalization layer optimization strategies were adopted. Gradient instability in the early training stage was reduced. Network convergence was accelerated. Feature representation ability was enhanced.The mAP of this method is 88.4%, 78.9%, and 54.6% on the Market1501, DukeMTMC-ReID, and MSMT17_V2 datasets. The Rank-1 index is 95.3%, 88.9%, and 78.1%. These results are superior to those of existing mainstream methods.The visualization of heat maps and ranking charts shows that pedestrian details are accurately captured. Strong adaptability to complex scenes is achieved. This method provides an effective solution to improve the robustness of cross-camera pedestrian matching.

    参考文献
    相似文献
    引证文献
引用本文

刘梓良,伊力哈木·亚尔买买提. 多维度注意力与距离比例约束的行人重识别鲁棒方法[J]. 科学技术与工程, 2026, 26(13): 5616-5627.
Liu Ziliang, Yilihamu Yaermaimaiti. Robust Pedestrian Re-identification with Multi-dimensional Attention and Distance Ratio Constraint[J]. Science Technology and Engineering,2026,26(13):5616-5627.

复制
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-05-19
  • 最后修改日期:2026-05-15
  • 录用日期:2025-11-19
  • 在线发布日期: 2026-05-18
  • 出版日期:
×
2026年会通知 | “技术经济学驱动智能经济生态构建与治理变革”——中国技术经济学会第三十三届学术年会(2026)会议通知暨征文启事(第一轮)
亟待确认版面费归属稿件,敬请作者关注