基于多路协同与空谱先验的高光谱与多光谱图像融合

doi:10.3785/j.issn.1008-973X.2026.07.006

浙江大学学报(工学版)

2026, Vol. 60

Issue (7): 1427-1437 DOI: 10.3785/j.issn.1008-973X.2026.07.006

计算机与控制工程

基于多路协同与空谱先验的高光谱与多光谱图像融合

杨艳春(

),李佳龙

兰州交通大学电子与信息工程学院，甘肃兰州 730070

Multi-path collaboration-based and spatial-spectral prior-based hyperspectral and multispectral image fusion

Yanchun YANG(

),Jialong LI

School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China

全文: PDF(3061 KB) HTML

摘要：

针对高光谱与多光谱图像融合中全局建模与局部细节捕捉不足以及光谱维度相邻波段相关性难以探索的问题，提出多路协同与空谱先验的高光谱与多光谱图像融合方法. 主干网络由局部瓶颈控制单元与Transformer并联构成，局部瓶颈控制单元学习局部结构并抑制冗余特征，Transformer处理长距离依赖，双向交互融合机制增强对局部细节与全局上下文的理解. 在空间与光谱联合先验模块中，对于空间注意力采用双路径池化策略，并采用光谱内部分组注意力机制衡量波段关联程度. 多路聚合网络通过残差块与逐层递进融合策略整合特征. 实验表明，在CAVE数据集上，该方法的PSNR和SSIM较其他8种方法分别平均提升4.5%、0.7%，在局部与全局特征捕捉及空谱先验信息融合方面优势明显.

关键词： 高光谱与多光谱图像融合; 局部与全局协同; Transformer; 空间与光谱联合先验; 光谱分组注意力机制

Abstract:

A multi-path collaboration-based and spatial-spectral prior-based fusion method was proposed for hyperspectral and multispectral images, to address the challenges of insufficient global modeling and local detail capture in hyperspectral-multispectral image fusion, as well as the difficulty in exploring correlations between adjacent spectral bands. Firstly, the backbone network integrated a Local Bottleneck Control Unit and a Transformer in a parallel architecture. The Local Bottleneck Control Unit learned local structures while suppressing redundant features, whereas the Transformer handled long-range dependencies. A bidirectional interactive fusion mechanism was adopted to enhance the comprehension of both local details and global contexts. Secondly, the spatial-spectral joint prior module employed a dual-path pooling strategy for spatial attention and introduced an intra-spectral grouped attention mechanism to quantify inter-band correlations. Finally, the multi-path aggregation network consolidated features through residual blocks and a progressive fusion strategy. Experimental results demonstrated that the proposed method achieved average improvements of 4.5% in PSNR and 0.7% in SSIM compared to eight other methods on the CAVE dataset, exhibiting superior performance in capturing local-global features and integrating spatial-spectral prior information.

Key words: hyperspectral and multispectral image fusion local and global collaboration Transformer joint spatial and spectral priors spectral grouping attention mechanism

收稿日期: 2025-04-16 出版日期: 2026-05-23

CLC:

TP 391

基金资助: 国家自然科学基金资助项目（62462043，62067006）；甘肃省重点研发计划资助项目（25YFGA047）；甘肃省自然科学基金资助项目（23JRRA847，21JR7RA300）.

作者简介: 杨艳春（1979—），女，副教授，从事图像融合和图像处理研究. orcid.org/0009-0004-6106-9551. E-mail：yangyanchun102@sina.com

	服务
	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	作者相关文章
	杨艳春
	李佳龙

引用本文:

杨艳春,李佳龙. 基于多路协同与空谱先验的高光谱与多光谱图像融合[J]. 浙江大学学报(工学版), 2026, 60(7): 1427-1437.

Yanchun YANG,Jialong LI. Multi-path collaboration-based and spatial-spectral prior-based hyperspectral and multispectral image fusion. Journal of ZheJiang University (Engineering Science), 2026, 60(7): 1427-1437.

链接本文:

https://www.zjujournals.com/eng/CN/10.3785/j.issn.1008-973X.2026.07.006 或 https://www.zjujournals.com/eng/CN/Y2026/V60/I7/1427

图 1 融合网络总体架构

图 2 局部瓶颈控制单元与Transformer

图 3 空间与光谱先验模块

图 4 光谱分组注意力

图 5 CAVE数据集上的实验结果

表 1 CAVE实验结果的评价指标均值

图 6 Harvard数据集上的实验结果

表 2 Harvard实验结果的评价指标均值

图 7 Pavia University数据集上的实验结果

表 3 Pavia University实验结果的评价指标均值

图 8 实验结果各波段PSNR可视化展示

图 9 消融实验结果

表 4 消融实验定量分析

表 5 各模块参数量和计算复杂度分析

1	WANG Z, CHEN J, HOI S C H Deep learning for image super-resolution: a survey[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 43 (10): 3365- 3387 doi: 10.1109/TPAMI.2020.2982166
2	VIVONE G Multispectral and hyperspectral image fusion in remote sensing: a survey[J]. Information Fusion, 2023, 89: 405- 417 doi: 10.1016/j.inffus.2022.08.032
3	胡明志, 孙俊, 杨彪, 等基于CNN和Transformer聚合的遥感图像超分辨率重建[J]. 浙江大学学报: 工学版, 2025, 59 (5): 938- 946 HU Mingzhi, SUN Jun, YANG Biao, et al Super-resolution reconstruction of remote sensing image based on CNN and Transformer aggregation[J]. Journal of Zhejiang University: Engineering Science, 2025, 59 (5): 938- 946 doi: 10.3785/j.issn.1008-973X.2025.05.007
4	HONG D, GAO L, YOKOYA N, et al More diverse means better: multimodal deep learning meets remote-sensing imagery classification[J]. IEEE Transactions on Geoscience and Remote Sensing, 2021, 59 (5): 4340- 4354 doi: 10.1109/TGRS.2020.3016820
5	ZHUANG L, NG M K, FU X, et al Hy-demosaicing: hyperspectral blind reconstruction from spectral subsampling[J]. IEEE Transactions on Geoscience and Remote Sensing, 2021, 60: 5515815
6	DENG S Q, DENG L J, WU X, et al PSRT: pyramid shuffle-and-reshuffle transformer for multispectral and hyperspectral image fusion[J]. IEEE Transactions on Geoscience and Remote Sensing, 2023, 61: 5503715 doi: 10.1109/tgrs.2023.3244750
7	吕鑫栋, 李娇, 邓真楠, 等基于改进Transformer的结构化图像超分辨网络[J]. 浙江大学学报: 工学版, 2023, 57 (5): 865- 874,910 LV Xindong, LI Jiao, DENG Zhennan, et al Structured image super-resolution network based on improved Transformer[J]. Journal of Zhejiang University: Engineering Science, 2023, 57 (5): 865- 874,910 doi: 10.3785/j.issn.1008-973X.2023.05.002
8	LI S, DIAN R, FANG L, et al Fusing hyperspectral and multispectral images via coupled sparse tensor factorization[J]. IEEE Transactions on Image Processing, 2018, 27 (8): 4118- 4130
9	DIAN R, LI S, FANG L, et al Multispectral and hyperspectral image fusion with spatial-spectral sparse representation[J]. Information Fusion, 2019, 49: 262- 270 doi: 10.1016/j.inffus.2018.11.012
10	PALSSON F, SVEINSSON J R, ULFARSSON M O Multispectral and hyperspectral image fusion using a 3-D-convolutional neural network[J]. IEEE Geoscience and Remote Sensing Letters, 2017, 14 (5): 639- 643 doi: 10.1109/LGRS.2017.2668299
11	ZHANG X, HUANG W, WANG Q, et al SSR-NET: spatial-spectral reconstruction network for hyperspectral and multispectral image fusion[J]. IEEE Transactions on Geoscience and Remote Sensing, 2020, 59 (7): 5953- 5965 doi: 10.1109/tgrs.2020.3018732
12	DIAN R, LI S, KANG X Regularizing hyperspectral and multispectral image fusion by CNN denoiser[J]. IEEE Transactions on Neural Networks and Learning Systems, 2021, 32 (3): 1124- 1135 doi: 10.1109/TNNLS.2020.2980398
13	YU H, LING Z, ZHENG K, et al Unsupervised hyperspectral and multispectral image fusion with deep spectral-spatial collaborative constraint[J]. IEEE Transactions on Geoscience and Remote Sensing, 2024, 62: 5534114 doi: 10.1109/tgrs.2024.3472226
14	YAN J, ZHANG K, SUN Q, et al Spatial-spectral unfolding network with mutual guidance for multispectral and hyperspectral image fusion[J]. Pattern Recognition, 2025, 161: 111277 doi: 10.1016/j.patcog.2024.111277
15	LI J, ZHENG K, GAO L, et al Enhanced deep image prior for unsupervised hyperspectral image super-resolution[J]. IEEE Transactions on Geoscience and Remote Sensing, 2025, 63: 5504218
16	HU J F, HUANG T Z, DENG L J, et al Fusformer: a transformer-based fusion network for hyperspectral image super-resolution[J]. IEEE Geoscience and Remote Sensing Letters, 2022, 19: 6012305
17	MA Q, JIANG J, LIU X, et al Learning a 3D-CNN and Transformer prior for hyperspectral image super-resolution[J]. Information Fusion, 2023, 100: 101907 doi: 10.1016/j.inffus.2023.101907
18	JIA S, MIN Z, FU X Multiscale spatial-spectral transformer network for hyperspectral and multispectral image fusion[J]. Information Fusion, 2023, 96: 117- 129 doi: 10.1016/j.inffus.2023.03.011
19	SUN L, ZHOU J, YE Q, et al MDC-FusFormer: multiscale deep cross-fusion transformer network for hyperspectral and multispectral image fusion[J]. IEEE Transactions on Geoscience and Remote Sensing, 2024, 62: 5528914 doi: 10.1109/tgrs.2024.3451551
20	WANG X, ZHANG F, ZHANG K, et al Learning spatial-spectral dual adaptive graph embedding for multispectral and hyperspectral image fusion[J]. Pattern Recognition, 2024, 151: 110365 doi: 10.1016/j.patcog.2024.110365
21	LIU S, SHAO T, LIU S, et al An asymptotic multiscale symmetric fusion network for hyperspectral and multispectral image fusion[J]. IEEE Transactions on Geoscience and Remote Sensing, 2025, 63: 5503016
22	LIU Z, LIN Y, CAO Y, et al. Swin transformer: hierarchical vision transformer using shifted windows [C]// IEEE/CVF International Conference on Computer Vision. Montreal: IEEE, 2021: 9992–10002.
23	MA Q, JIANG J, LIU X, et al Reciprocal transformer for hyperspectral and multispectral image fusion[J]. Information Fusion, 2024, 104: 102148 doi: 10.1016/j.inffus.2023.102148
24	LIU Z, WANG W, MA Q, et al Rethinking 3D-CNN in hyperspectral image super-resolution[J]. Remote Sensing, 2023, 15 (10): 2574 doi: 10.3390/rs15102574
25	ANUL HAQ M, BEN HADJ HASSINE S, MALEBARY S J, et al 3D-CNNHSR: a 3-dimensional convolutional neural network for hyperspectral super-resolution[J]. Computer Systems Science and Engineering, 2023, 47 (2): 2689- 2705 doi: 10.32604/csse.2023.039904
26	YASUMA F, MITSUNAGA T, ISO D, et al Generalized assorted pixel camera: postcapture control of resolution, dynamic range, and spectrum[J]. IEEE Transactions on Image Processing, 2010, 19 (9): 2241- 2253 doi: 10.1109/TIP.2010.2046811
27	CHAKRABARTI A, ZICKLER T. Statistics of real-world hyperspectral images [C]// CVPR 2011. Colorado Springs: IEEE, 2011: 193–200.
28	XIE Q, ZHOU M, ZHAO Q, et al. Multispectral and hyperspectral image fusion by MS/HS fusion net [C]// IEEE/CVF Conference on Computer Vision and Pattern Recognition. Long Beach: IEEE, 2019: 1585–1594.
29	WANG W, ZENG W, HUANG Y, et al. Deep blind hyperspectral image fusion [C]// IEEE/CVF International Conference on Computer Vision. Seoul: IEEE, 2019: 4149-4158.

[1]	徐恺蔚,KHIZER BIN TALIBHafiz,曹衍龙,许源平,许志杰,宋景春. 基于光流和卷积视觉Transformer的轻量级微表情识别[J]. 浙江大学学报(工学版), 2026, 60(7): 1381-1391.
[2]	梁礼明,王成斌,钟奕,陈林俊,吴健. 基于轻量高频Transformer与特征互补融合的视网膜血管分割[J]. 浙江大学学报(工学版), 2026, 60(7): 1392-1403.
[3]	董绍江,肖涛,吕振鸣,夏浩然,罗家元,孙世政,张霞,刘超. 基于改进RT-DETR的水下色偏环境中小型生物检测[J]. 浙江大学学报(工学版), 2026, 60(7): 1404-1415.
[4]	郑文军,黎志昆,韩守飞. 知识增强图Transformer的方面级情感分析[J]. 浙江大学学报(工学版), 2026, 60(6): 1269-1276.
[5]	边文远,火久元,常琛. 基于改进的插补扩散模型与LSTM的风电数据清洗方法[J]. 浙江大学学报(工学版), 2026, 60(5): 1016-1026.
[6]	彭静,闫佳荣,刘佳英,魏子易,白珊,邓亚红. 多尺度残差学习结合Dilformer的双流医学图像配准网络[J]. 浙江大学学报(工学版), 2026, 60(5): 1082-1091.
[7]	侯玉珍,沈晓红,李莉,杨明源,张彩明. 基于掩模和非局部注意力的双阶段去雨网络[J]. 浙江大学学报(工学版), 2026, 60(4): 791-799.
[8]	万刚,王小波,石纲,叶德震,朱思思,司帆. 基于特征细化与注意力增强重构的水下图像增强算法[J]. 浙江大学学报(工学版), 2026, 60(4): 800-811.
[9]	包晓安,彭书友,张娜,涂小妹,张庆琪,吴彪. 基于多方位感知深度融合检测头的目标检测算法[J]. 浙江大学学报(工学版), 2026, 60(1): 32-42.
[10]	孟璇,张雪英,孙颖,周雅茹. 基于电极排列和Transformer的脑电情感识别[J]. 浙江大学学报(工学版), 2025, 59(9): 1872-1880.
[11]	刘杰,吴优,田佳禾,韩轲. 改进Transformer的肺部CT图像超分辨率重建[J]. 浙江大学学报(工学版), 2025, 59(7): 1434-1442.
[12]	蔡永青,韩成,权巍,陈兀迪. 基于注意力机制的视觉诱导晕动症评估模型[J]. 浙江大学学报(工学版), 2025, 59(6): 1110-1118.
[13]	张梦瑶,周杰,李文婷,赵勇. 结合全局信息和局部信息的三维网格分割框架[J]. 浙江大学学报(工学版), 2025, 59(5): 912-919.
[14]	张德军,白燕子,曹锋,吴亦奇,徐战亚. 面向密集预测任务的点云Transformer适配器[J]. 浙江大学学报(工学版), 2025, 59(5): 920-928.
[15]	马莉,王永顺,胡瑶,范磊. 预训练长短时空交错Transformer在交通流预测中的应用[J]. 浙江大学学报(工学版), 2025, 59(4): 669-678.

Viewed

Full text

Abstract

Cited

Shared

Discussed