泰勒展开与复合注意力引导的红外与可见光图像融合

杨艳春; 李毅

期刊检索

关键词检索

新闻公告MORE

主管单位 中华人民共和国工业和信息化部 主办单位 哈尔滨工业大学主编李隆球 国际刊号ISSN 0367-6234 国内刊号CN 23-1235/T

期刊网站二维码

微信公众号二维码

引用本文:	杨艳春,李毅.泰勒展开与复合注意力引导的红外与可见光图像融合[J].哈尔滨工业大学学报,2026,58(5):54.DOI:10.11918/202509025
	YANG Yanchun,LI Yi.Infrared and visible image fusion guided by Taylor expansion and composite attention[J].Journal of Harbin Institute of Technology,2026,58(5):54.DOI:10.11918/202509025

【打印本页】【HTML】【下载PDF全文】【查看/发表评论】【下载PDF阅读器】【关闭】

过刊浏览高级检索

本文已被：浏览 900次下载 27次	码上扫一扫！
分享到：微信更多字体:加大+\|默认\|缩小-
泰勒展开与复合注意力引导的红外与可见光图像融合
杨艳春,李毅
(兰州交通大学电子与信息工程学院,兰州 730070)

摘要:

为解决深度学习融合算法中存在的忽略像素间相关性,导致融合结果丢失重要全局纹理,以及难以平衡目标突出与场景增强的问题,本文提出了一种泰勒展开与复合注意力机制引导的红外与可见光图像融合算法。首先,设计了一种泰勒展开网络,将输入图像分解为映射层与导数层,从而实现对图像多层次特征信息的有效提取；其次,采用双分支特征提取网络,其中平行卷积网络负责捕获局部细节特征,SwinTransformer模块则专注于提取全局上下文信息,确保局部与全局特征的高效保留；再次,引入复合注意力机制来进一步提升特征融合的精度,该机制通过轴向注意力融合空间维度特征,同时利用通道注意力强化通道间的特征响应,以实现更精细的特征选择与融合。最后,通过图像重建得到融合图像。在公开数据集MSRS和RoadScene进行了相关实验,结果表明,本文方法融合图像不仅在纹理细节保持与全局信息保留方面更完整,而且在客观指标中取得显著优势。该研究结果可为深度学习图像融合领域提供新的思路。

关键词: 红外与可见光图像融合泰勒展开网络 SwinTransformer 双分支特征提取复合注意力机制

DOI：10.11918/202509025

分类号:TP391；TN29

文献标识码:A

基金项目:国家自然科学基金(3,6)；甘肃省重点研发计划(25YFGA047)；甘肃省自然科学基金(23JRRA7,1JR7RA300)

Infrared and visible image fusion guided by Taylor expansion and composite attention

YANG Yanchun,LI Yi

(School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China)

Abstract:

In order to solve the problems of ignoring the correlation between pixels in the deep learning fusion algorithm, which leads to the loss of important global texture in the fusion results, and the difficulty of balancing target highlight and scene enhancement, this paper proposed an infrared and visible image fusion algorithm guided by Taylor expansion and composite attention mechanism. Firstly, a Taylor expansion network was designed to decomposition the input image into a mapping layer and a derivative layer, so as to effectively extract the multi-level feature information of the image. Secondly, a dual-branch feature extraction network was used, in which the parallel convolutional network was responsible for capturing local detail features, and the SwinTransformer module focused on extracting global context information to ensure the efficient retention of local and global features. Then, the composite attention mechanism is introduced to further improve the accuracy of feature fusion. This mechanism fuses spatial dimensional features through axial attention, and uses channel attention to strengthen the feature response between channels, so as to achieve more refined feature selection and fusion. Finally, the fused image was obtained by image reconstruction. Experiments are carried out on the public datasets MSRS and RoadScene. The results show that the proposed method is not only more complete in maintaining texture details and global information, but also achieves significant advantages in objective indicators. The research results can provide new ideas for the field of deep learning image fusion.

Key words: infrared and visible image fusion Taylor expansion network SwinTransformer dual-branch feature extraction compound attention mechanism

期刊检索

关键词检索

新闻公告MORE

友情链接LINKS