基于注意力变形和动态查询机制的交通小目标检测

李建新; 朱进玉; 乔鸿政; 石浩楠

期刊检索

关键词检索

新闻公告MORE

主管单位 中华人民共和国工业和信息化部 主办单位 哈尔滨工业大学主编李隆球 国际刊号ISSN 0367-6234 国内刊号CN 23-1235/T

期刊网站二维码

微信公众号二维码

引用本文:	李建新,朱进玉,乔鸿政,石浩楠.基于注意力变形和动态查询机制的交通小目标检测[J].哈尔滨工业大学学报,2025,57(7):81.DOI:10.11918/202402020
	LI Jianxin,ZHU Jinyu,QIAO Hongzheng,SHI Haonan.Traffic small object detection based on attention deformation and dynamic query mechanism[J].Journal of Harbin Institute of Technology,2025,57(7):81.DOI:10.11918/202402020

【打印本页】【HTML】【下载PDF全文】【查看/发表评论】【下载PDF阅读器】【关闭】

过刊浏览高级检索

本文已被：浏览 2761次下载 342次	码上扫一扫！
分享到：微信更多字体:加大+\|默认\|缩小-
基于注意力变形和动态查询机制的交通小目标检测
李建新^1,2,朱进玉¹,乔鸿政³,石浩楠¹
(1．长安大学电控学院,西安 710064;2．中汽零部件技术(天津)有限公司,天津 300300; 3．厦门大学航空航天学院,福建厦门 361005)

摘要:

深度学习推动了交通目标检测发展,但复杂交通场景下密集遮挡环境中的小目标检测精度仍不足。针对上述问题提出一种注意力变形和动态查询机制的交通小目标检测算法CDAQ-DDETR,在Deformable DETR的基础上,通过引入CBAM注意力双塔机制和DCNv2可变形卷积重构原始残差网络,增强算法对密集区域交通小目标的语义获取能力；借助AFN网络思想添加低层特征,同时构建注意力感知融合金字塔模块,提高算法对多尺度中小交通目标的检测效果；依靠在原解码器前向集成动态查询机制模块结合输入图像匹配目标特性,以构建最佳查询向量提升算法对多样化背景干扰的适应泛化能力。在VisDrone2019数据集上进行实验,结果表明:CDAQ-DDETR算法在平均精确率（mAP@0.5:0.95）上已达到37.9%,在平均召回率（mAR@0.5:0.95）上已达到57.4%,相比现阶段主流SOTA算法在检测精度上提升5.5%,召回率提升8.0%,尤其针对于小目标检测精度提升6.9%,召回率提升了10.0%,同时利用可视化实验分析其更加适用于密集场景下交通小目标检测的实际应用。

关键词: 交通目标检测密集场景小目标检测 Deformable DETR Transformer算法

DOI：10.11918/202402020

分类号:TP391

文献标识码:A

基金项目:国家自然基金重点项目(52232015)；陕西省科技发展计划项目“两链”融合重点专项(2023KXJ-297)

Traffic small object detection based on attention deformation and dynamic query mechanism

LI Jianxin^1,2,ZHU Jinyu¹,QIAO Hongzheng³,SHI Haonan¹

(1.School of Electrical Control, Chang′an University, Xi′an 710064, China; 2.CATARC Component Technology (Tianjin) Co., Ltd., Tianjin 300300, China; 3.School of Aeronautics and Astronautics, Xiamen University, Xiamen 361005, Fujian, China)

Abstract:

While deep learning has advanced traffic object detection, accurately detecting small objects in complex traffic scenes with dense occlusion remains challenging. To address these issues, this paper proposes a novel small traffic object detection algorithm, CDAQ-DDETR, which incorporates an attention-based deformation and dynamic querying mechanism. Building upon Deformable DETR, the algorithm introduces the CBAM attention-based dual-tower mechanism and DCNv2 Deformable convolutions to reconstruct the original residual network, thereby enhancing the semantic acquisition capabilities for small traffic objects in dense areas. By leveraging the AFN network concept to add lower-level features and constructing an attention-aware fusion pyramid module, the algorithm improves detection performance for multi-scale small and medium traffic objects. Additionally, by integrating a dynamic query mechanism module before the original decoder, combined with matching input image characteristics, it constructs optimal query vectors, enhancing the algorithm′s adaptability and generalization ability against diverse background interferences. Experiments conducted on the VisDrone2019 dataset show that the CDAQ-DDETR algorithm has achieved a mean Average Precision (mAP@0.5:0.95) of 37.9% and a mean Average Recall (mAR@0.5:0.95) of 57.4%. Compared to the current state-of-the-art (SOTA) algorithms, there is an improvement of 5.5% in detection precision and 8.0% in recall rate, particularly, an increase of 6.9% in precision and 10.0% in recall rate for detecting small objects. Visualization experiments further demonstrate its practical applicability and superior performance in detecting small traffic objects in dense scenes.

Key words: traffic object detection dense scenes small object detection Deformable DETR Transformer algorithm

期刊检索

关键词检索

新闻公告MORE

友情链接LINKS